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ABSTRACT 
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presents the results of a survey of Association of Research Libraries (ARL) 
member libraries. The survey addressed the following questions about numeric 
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are libraries devoting to numeric data? and (5) How do libraries evaluate 
their own performance regarding numeric data? The survey was divided into six 
topical areas: background; numeric data product collections; access to 
machine-readable numeric data products; staffing; budget; and evaluation of 
numeric data collections and services. A total of 66 of 121 ARL member 
libraries responded to the survey. A copy of the questionnaire with tabulated 
results is presented. Representative documents include collection policies, 
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Executive Summary 



Introduction 

As scholarship places increasing emphasis on 
quantitative analysis, the need for numeric data 
grows across academic disciplines. Students and 
faculty require better access to sources of numeric 
data ranging from simple descriptive statistics to 
large sets of raw data. A "numeric data product" 
may be defined as any information resource — print 
or nonprint — with considerable numeric content. 
This definition focuses on the content of numeric 
data products, irrespective of format. As is evident 
from the survey results, many perceive data as 
strictly limited to machine-readable files. Such a 
perception ignores the fact that data comes in a wide 
variety of formats. 

Academic departments, campus computing 
centers, and libraries have all provided support to 
varying degrees for researchers using numeric data. 
While the demand grows, specialized data services 
are quickly becoming a major operation. On many 
campuses, the library has become a central access 
point for numeric data users by coordinating collec- 
tions of data resources and services to researchers. 

As more academic libraries join the data game, they 
look to their peer institutions for models of service, 
collection development, access policies, budgets, and 
staffing patterns. 

Academic libraries have historically collected and 
provided access to bibliographic sources that contain 
some amount of numeric data. Libraries that are also 
government document depositories have often 
served as gateways to numeric data, as many of the 
publications distributed through depository pro- 
grams are statistical in nature. The emergence of 
major statistical indexes has helped libraries, both 
depository and nondepository, provide improved 
access to sources of numeric data. Such products as 
the American Statistics Index , published by the 



Congressional Information Service, allow users to 
identify and access publications that contain sub- 
stantial numeric content. 

As with bibliographic resources, the demand 
among numeric data users for immediate, full-text 
access is on the rise. Since the release of 1990 Census 
data on CD-ROM, academic institutions have faced 
an increasing demand for numeric data, especially in 
a machine-readable form. Data consortia, such as the 
Inter-university Consortium for Political and Social 
Research (ICPSR), have also provided more wide- 
spread access to numeric data by archiving and 
distributing data files. 

The survey results indicate that most academic 
libraries collect and service numeric data, but a 
disparity exists in how they define numeric data, 
what specific services are provided, what levels of 
staffing and budget are needed, and how they 
evaluate data collections and services. It is hoped 
that the survey results will help identify some best 
practices, assuage some fears, and foster collabora- 
tion among institutions and individuals working to 
provide numeric data services. 

Survey 

While many academic libraries collect and provide 
services for numeric data resources, little has been written 
about how such resources are selected or which specific 
services are provided. The investigators have attempted to 
find answers to several specific questions about numeric 
data in academic libraries. What relationships exist 
between libraries and other academic units that have 
responsibility for numeric data? What are the principle 
sources for numeric data? How much are libraries 
spending to acquire numeric data resources? How much 
staff time are libraries devoting to numeric data? How do 
libraries evaluate their own performance regarding 
numeric data? The investigators designed their survey to 
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try to capture some of this information. 

During the spring of 2001, an eighteen-question 
survey was distributed to ARL member institutions 
in the United States and Canada. The survey was 
divided into six topical areas: Background, Numeric 
Data Product Collections, Access to Machine- 
readable Numeric Data Products, Staffing, Budget, 
and Evaluation of Numeric Data Collections & 
Services. In addition to basic information gathering 
about data collections and services, respondents 
were presented with an opportunity to rank their 
service levels based on desired and perceived 
performance. The investigators draw from the 
LibQUAL+ methodology recently developed by ARL. 
Sixty-six of the 121 ARL member libraries (55%) 
responded to the survey. 

Responsibility for Data 

Sixty-four out of 66 libraries (97%) indicated they 
collect or provide access to numeric data. All of these 
respondents provide some assistance for users trying to 
locate numeric data. Apart from helping users locate data, 
the most frequently reported functions among libraries 
were: cataloging numeric data (98%), providing access to 
data (98%), collecting numeric data products (97%), 
storing numeric data (95%), answering reference ques- 
tions about data (95%), and assisting users to retrieve data 
(92%). 

The survey asked respondents to indicate which of the 
other campus units also perform various functions 
associated with data services. When it comes to assisting 
users with data analysis, about a third (31%) of libraries 
provide this service, while close to half of the respondents 
(45%) indicated that academic departments perform this 
function. This seems to correspond with the typical 
limitation many academic librarians place on reference 
service; namely, to help users get to information, but leave 
the interpretation of that information to the user. Other 
less frequently reported functions performed by libraries 
include: archiving data (53%), developing web-based 
tools to access numeric data (52%), and collecting 
codebooks (78%). 

Among the other campus units, academic departments 
appear to provide a significant share of data services. 
Fifty-nine percent of respondents indicated that depart- 
ments also collect data products, 47% said they store data. 



36% said they provide access to data, and 31% said they 
provide user instruction. Overall, libraries appear to take 
the lead in handling numeric data. It should be noted, 
however, that this trend in responses is very likely a result 
of the fact that all the organizations surveyed were 
libraries. If a similar survey had been administered to the 
other campus units, a similar skew would probably have 
resulted. Nevertheless, the trend is telling, as libraries 
often are not aware of what other units on campus are 
doing with regard to data. Libraries considering providing 
more data-related services would do well to investigate 
what the other units are doing, and develop partnerships 
with them. 

While 48% of responding libraries indicated they have 
no formal relationship with another academic unit 
responsible for numeric data, over a third (39%) hold both 
data and codebooks that are used by the data unit, and 
34% include the data unit as an administrative part of the 
library. Among the explanatory responses, the majority of 
those commenting indicated they had an informal but 
strong relationship to other units, whether or not they are 
within the library. 

Data Collection Development 

Obtaining information about the number of data 
products libraries are collecting proved to be very diffi- 
cult. On average, only about a third of the respondents 
were able to provide any numbers in response to this 
question. An overwhelming number of respondents 
indicated that approximating the number of products 
collected is difficult at best and perhaps impossible, as 
data products are distributed in various forms and from 
various sources. 

Of those that could respond (22 out of 64), about 
half (48%) indicated they receive between 100-500 
data products through a government depository 
program, 13% said they got 3,000 or more. Two- 
thirds (67%) said they receive 50 or fewer products 
via international governmental organizations (such 
as the UN, IMF, and OECD). About half (48%) 
indicated they received 20 or fewer products from 
commercial sources. Given the difficulty in answer- 
ing this question, perhaps some other method of 
estimating the percentages of the library's total 
collection received from these various sources, rather 
than exact numbers, could shed better light. 
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The majority of respondents (89%) indicated their 
institutions were members of the ICPSR. Of those, 60% 
indicated that their official representative to the ICPSR 
was a member of the library staff. Over a quarter (27%) 
said their institutions participated with the Roper Center 
for Public Opinion Research. Other consortia included the 
European Consortium for Political Research, R-Cade, 
Statistics Canada’s Data Liberation Initiative, 
Sociometrics’ Electronic Social Science Data Library and 
statewide consortia. 

Over a quarter (27%) of respondents have a 
collection development policy covering numeric 
data. A preponderance (98%) of libraries collects 
numeric data in the social sciences. Over three* 
quarters (77%) collect in the sciences, while a little 
over half (56%) collect numeric data in the humani- 
ties. Business was mentioned frequently as another 
distinct subject area, while others included environ- 
ment, public health, agriculture and law. 

With regard to formats in which data is collected, 
online (98%) and CD-ROM (97%) are in the majority, 
followed by paper (81%), diskette (75%), microform 
(70%), and even magnetic tape (23%). The only other 
format mentioned was DVD, which reflects an 
emerging format of choice. Some respondents also 
commented that data only comes electronically, 
while differentiating that from statistics, which are 
acquired in print, microform etc. This is an interest- 
ing distinction to make as it reflects a prevailing 
notion that data is something relegated only to the 
world of computers. 

Machine-readable Data 

Since machine-readable numeric data poses particular 
issues for access and bibliographic control, two questions 
were asked specifically concerning this format. Nearly all 
responding libraries (97%) provide cataloging of these 
numeric data resources in their online public-access 
catalogs, and just under a quarter (23%) have some other 
specialized catalog specifically for data. Many respon- 
dents indicated they rely on lists presented on web pages, 
and several respondents said they had some sort of 
cataloging scheme in development. 

Thirty-four percent of respondents indicated they have 
no formal policy with regard to patron access to machine- 
readable data. More than half (58%) said that access is 



freely available to the institution’s community, both on 
and off campus. Only a third (33%) restricted access to 
students, faculty, and staff. Over half of the respondents 
noted that walk-in users have access to their resources, 
both locally mounted and remotely accessed. Since there 
is a high degree of variability among license agreements 
(especially with regard to commercially produced data 
resources) nearly every data source, in effect, has a 
different access policy. Government document resources 
were often singled out as having no access restrictions. 

Staffing and Budgeting 

Just over a third of the responding libraries (38%) 
reported they employ someone specifically to acquire 
numeric data products. Of those, 79% hold the M.L.S. 
degree or its equivalent. Close to half (48%) said they 
employ someone specifically to handle service requests 
for numeric data, 65% of whom hold the M.L.S degree or 
its equivalent. Among other credentials held by these 
individuals are Bachelor’s and advanced degrees in 
political science, economics, and statistics. 

The most commonly provided job title was Data 
Librarian. Judging from the responses that provided 
the same title for both acquisitions and reference 
service responsibility, 13 of the above individuals 
(54%) are responsible for both areas. Many respon- 
dents expressed concern over use of the term "spe- 
cifically," reporting that the tasks of acquiring and 
providing service for numeric data are often distrib- 
uted among several librarians and staff, without 
having any one individual specifically charged with 
this responsibility. On average, libraries reported 
that they devoted 2 full time equivalent (FTE) 
librarians, 1 FTE support staff, and 1 FTE student 
staff to these responsibilities. 

The same difficulty in determining the number of data 
products in the collection reappears when trying to 
estimate a budget for data products. Many respondents 
indicated that it is difficult to derive a budgeted amount 
since numeric data is scattered throughout their collec- 
tions. Only 27 respondents out of 64 (42%) could provide 
some specific budget information. For those giving 
figures, the budgeted amounts range from zero to 
$200,000. The average amount is $31,825 with a median 
of $12,000. The majority (89%) budget less than $100,000 
and 30% budget below $2,000. 
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Evaluation of Data Services 

Respondents were asked to indicate what tech- 
niques or measures they used to evaluate the quality 
of their data collections and services. Nearly half 
(45%) admitted they do not perform any formal 
evaluation. Of those that did perform some evalua- 
tion, most (69%) preferred usage statistics to measure 
the quality of their data services. 

Working with the LibQUAL+ methodology, 
respondents were asked to rank their numeric data 
collections and services on a nine-point Likert scale, 
with 1 being the lowest level of service and 9 the 
highest. Seven aspects of data collections and 
services were addressed: convenient access to data, 
providing reference services, acquiring data prod- 
ucts, creating partnerships with other data units, 
collecting user feedback, adding products on re- 
quest, and maintaining error-free catalog records. 

For each category, respondents were asked to 
indicate what they thought should be a minimum 
service level, a desired service level, and the per- 
ceived performance level at their own library. 

For the most part, respondents tended to rank the 
minimum level of service at around 4-6, the desired 
level at 7-8, and the perceived level of performance 
generally landed somewhere in between. A look at 
the difference between average rankings for each of 
the three service levels reveals some interesting 
patterns and could indicate how well libraries are 
performing. In the area of providing convenient 
access to data products, the average ranking for a 
minimum level of service is 5.32, the desired level is 
8.23, and the perceived level is 5.80. The relatively 
high desired level of service would indicate that 
libraries consider convenient access to be important. 
Since the perceived level of performance is fairly 
close to the minimum level, one could say that 
libraries are providing convenient access at about the 
minimal level of service. In other words, there is 
some room for improvement. 

Similarly, looking at the difference between 
rankings across the various categories could offer 
some insight into how libraries value the different 
aspects of data collections and services. With regard 
to providing error-free catalog records, the average 
rank for minimal service is 6.17, the desired level is 



8.55, and the perceived level is 6.60. Since the 
average minimal level and the average desired level 
are both higher for cataloging than they are for 
convenient access, one might infer that the libraries 
surveyed generally value accurate cataloging 
somewhat more highly than convenience. In fact, 
this seems consistent with the findings above that 
show most libraries indicate they catalog numeric 
data products and provide cataloging for machine- 
readable data as well. 

With regard to collecting user feedback, the 
average minimum level is 4.56, the desired level is 
7.54, and the perceived level is 4.00. These numbers 
would indicate that not only do the libraries seem to 
value collecting feedback less than they value 
cataloging, but also that they are not collecting 
feedback as well as they think they should. This 
seems entirely consistent with the earlier findings 
that show almost half of the libraries do not employ 
any specific measure to evaluate their collections and 
services. Likewise, in the area of creating partner- 
ships, the minimum service level is 4.86, the desired 
is 7.80, and the perceived is 5.56. This would indicate 
that libraries may not value the creation of partner- 
ships as highly as other aspects, and that they are a 
bit more satisfied with their performance in this area. 
Again this appears to be consistent with the findings 
above that show about half have no formal relation- 
ships with the other data units on campus, but seem 
to rely somewhat on informal arrangements. 

Issues Raised 

The results of this survey raise several important issues 
for libraries to consider while planning to create or expand 
numeric data collections and services. Among these are: 
creating a working definition of numeric data, determina- 
tion of services to be provided, relationships with other 
campus data units, staffing and budget, access policies, 
and evaluation mechanisms. 

Perhaps the most fundamental place to begin thinking 
about data collections and services is a decision on how 
the library will define what it means by “numeric data.” 
Many respondents aired their views regarding this 
survey's definition of data. Some found it confusing while 
others found it out of conformity with traditional defini- 
tions. Interestingly, whereas some indicated that numeric 
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data has never been thought of as a separate, distinct part 
of the collection, others pointed out that numeric collec- 
tions are very different from traditional library collections. 
The result was that both groups found some questions 
difficult to answer. While the precise definition is cer- 
tainly open to debate, the crucial matter is for libraries to 
have some understanding of what they will consider data 
to be, and to create or modify their collection development 
policies and service plans appropriately. 

Libraries must decide for themselves what 
specific services they will provide with respect to 
data. Will they only provide reference assistance; 
limiting themselves to guiding users to sources of 
data and helping them copy or download? Will they 
take the next step and provide assistance in analyz- 
ing and presenting data in meaningful ways? Will 
they place data files on a server and create an 
interface for users to access the data? Will they only 
hold the data products and leave users to handle 
things on their own? Whatever services libraries 
decide to provide, they should have a firm sense of 
what these will be before proceeding. 

Libraries should investigate what other units on 
campus are doing with regard to data. Has the 
academic computing department set up a data lab 
for students? Might someone there have expertise in 
data analysis to whom the library could refer users? 
Does the library have a subscription to a service that 
could help the data lab locate additional sources? 
Collaborative relationships between the library and 
other data units would go a long way towards 
providing more cohesive service to the university's 
researchers. They could also help spread the costs so 
these departments might not feel the pinch as hard 
as if they operated alone. 

Staffing and budgeting issues must also be 
settled. Once libraries know what services they plan 
to provide, they can determine what qualifications 
their staff would require to carry them out. Do they 
need to have a data librarian specifically charged to 
handle data, or could these functions be distributed 
among several librarians? Would they need to hire 
someone with a statistics background? The same is 
true for determining what kind of equipment the 
library would need and, consequently, how much 
they will need to spend. Will they need a new server, 



cabinets for CD-ROMs, extra terminals? 

The library should decide what limitations they 
might place on access to data resources. Many 
licensing agreements require that access be limited 
only to university affiliates, while most government- 
issued resources are freely accessible. There are 
various, and sometimes conflicting, obligations 
imposed upon such resources that make creation of 
an access policy difficult. Will the library require a 
sign-in sheet for some resources and leave others 
open? Will they only provide web access to free 
resources while limiting use of commercial resources 
to on-site use only? At a time when the software 
industry is seeking stricter enforcement of their 
intellectual property rights, licensing restrictions on 
data products are becoming a very hot issue. 

Finally, one issue that is often put on the back 
burner is evaluation. Many of the respondents do 
not have a mechanism for evaluating the quality of 
their collections or services, but having some method 
of evaluation is key for any future planning. Librar- 
ies might consider providing for an additional 
column on their reference statistics sheets to cover 
data-related questions. They could use server 
statistics to show how much usage their online data 
products are getting. A focus group could help show 
what other products or services users need. These 
techniques are also especially useful to aid libraries 
in deciding if they want to provide any specific data- 
related services at all. 

Conclusion 

Numeric data is quickly becoming an area for aca- 
demic libraries to venture into, as more emphasis in 
scholarship is placed on quantitative analysis. As with 
other more traditional bibliographic resources, numeric 
data is available in a variety of formats, each of which 
bringing its peculiar limitations and requirements. 

Libraries planning to enter the data game would do well to 
consider the issues raised by this survey as they proceed. 

A review of the accompanying documents provided by 
respondents may help others to identify some examples of 
good practices, as well as become more aware of the 
collections and services other institutions offer and some 
of the challenges they have encountered. 



SPEC Kit 263 

Numeric Data Products and Services 



Survey Results 



Many libraries, particularly those with Government Document Depositories, have long provided access to 
numeric data. The use of numeric data is burgeoning across many academic disciplines. Research libraries 
have extensive experience in acquiring and assisting patrons with machine-readable bibliographic databases, 
but what, if anything, are they doing with machine-readable numeric data products? Are they acquiring 
them? Do they provide any service beyond pointing researchers to them? How successful are such services 
in connecting researchers with data? 

The purpose of this survey is to identify which research libraries in the United States and Canada are 
working with numeric data products and to what extent. For the purpose of this survey numeric data 
product is defined as any information resource, print or non-print, with considerable numeric content. It is 
hoped that the survey results will offer some best practices, assuage some fears and foster collaboration 
among institutions and individuals working in this area. 

This survey was designed by Michael Cook, Social Sciences Bibliographer, Cornell University; John 
Hernandez, Librarian for U.S. Documents, Politics, and Public Administration, New York University; and 
Shawn Nicholson, State Documents and Social Sciences Librarian, Michigan State University. 

Please submit this survey and send the requested documentation by May 11, 2001. As always, individual 
responses to the survey will be treated confidentially. 



Note: Sixty-six of the 121 ARL member libraries (55%) responded to the survey. 

Background 

1. Does your institution collect and/or provide access to numeric data products? (n=66) 

Yes 64 97% 

No 2 3% 
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2. 



In the chart below, indicate which units in your institution are responsible for each numeric data 
activity. Check all that apply. (n=64) 







Other Data Units 




Library 


Computer 

Center 


Data 

Archive 


Academic 

Dept. 


Other 


Collects numeric data products 


62 


8 


14 


38 


11 


Stores numeric data 


61 


17 


14 


30 


10 


Archives numeric data 


34 


10 


11 


15 


7 


Catalogs numeric data 


63 


1 


7 


0 


3 


Provides access to numeric data 


63 


13 


14 


23 


12 


Collects codebooks 


50 


5 


12 


9 


6 


Catalogs codebooks 


49 


0 


6 


0 


1 


Provides access to codebooks 


54 


6 


12 


9 


5 


Provides user instruction for numeric 
data 


51 


16 


12 


20 


5 


Answers reference questions about 
numeric data 


61 


4 


9 


12 


5 


Refers questions to other data units 


51 


16 


13 


24 


9 


Assists users to locate data 


64 


4 


13 


10 


8 


Assists users to retrieve data 


59 


10 


14 


13 


10 


Assists users with data analysis 


20 


18 


9 


29 


10 


Develops web-based tools for 
accessing numeric data 


33 


8 


10 


7 


9 


Other 


7 


0 


1 


0 


0 



Please explain: Many respondents indicated that various institutes and even individual faculty 
members hold data. Working within cooperatives/consortia provides access to more data and services. 



If the library has responsibility for any of the above activities, please complete the survey. 
If not, please submit the survey now. 



3. Which of the following statements best describes the relationship between the library and other data 
service units? Check all that apply. (n=64) 



31 

22 

3 

1 



48% The library and the data unit are separate — they have no formal relationship. 

34% The data unit is an administrative unit within the library. 

5% The data unit is housed within the library, but is not formally a part of the library. 

2% The library refers all requests for numeric data products and services to the data 
unit. 





25 



39% The library holds both numeric data products and codebooks that are used by the 
data unit. 

11 17% The library holds a collection of numeric data products that are used by the data 

unit. 

1 2% The library holds a collection of codebooks, but not numeric data products, that are 

used by the data unit. 

25 39% Other 

Please explain: The majority of those commenting indicated an "informal" but 

strong relationship to other units, whether they are within or outside the library. 



Numeric Data Product Collections 

If the library collects numeric data products, please continue to question 4. If not, please skip to question 8. 

4. Approximately, how many numeric data products are collected from the following suppliers? (n=22) 

Government Document Depository Program: 48% receive 100-500 data products, 13% get 3,000 or 
more. 

International government organizations (e.g. UN, OECD, IMF, etc.): 67% receive 50 or fewer 
products. 

Data consortia (e.g. ICPSR, Roper, etc.): not coded 
Commercial vendors: 48% receive 20 or fewer products. 

Faculty: not coded 
Other: not coded 

Please explain: An overwhelming number of respondents suggested that approximating the 
number of products collected is difficult at best and perhaps impossible. The following is a typical 
response in this vein: "Difficult to quantify since our data products are distributed in various forms 
(e.g., online and CD-ROM databases, locally-loaded data files, CD-ROM, print, microfiche, web), 
and from various sources (e.g., federal, state, and international government." 

5. In what format(s) are numeric data products acquired? Check all that apply. (n=64) 



Online from remote source 


63 


98% 


CD-ROM 


62 


97% 


Paper 


52 


81% 


Diskette 


48 


75% 


Microform 


45 


70% 


Magnetic tape 


15 


23% 


Other 


6 


9% 



Please explain: The only other format mentioned was DVD. Some felt that "data" only 
comes electronically. "Statistics," on the other hand, will be acquired in print, microform etc. 
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In what subject areas does the library collect numeric data? Check all that apply. (n=64) 



Social Sciences 
Sciences 
Humanities 
Other 



63 98% 

49 77% 

36 56% 

15 23% 



Please explain: Business data was mentioned frequently. Other areas included 
environmental, public health, agricultural, and law. 

Does the library have a collection development policy for numeric data? (n=64) 

Yes 17 27% 

No 47 73% 



Does your institution participate in the Inter-university Consortium for Political and Social Research 
(ICPSR)? (n=64) 

Yes 57 89% 

No 7 11% 

If yes, is the Official Representative (OR) to the ICPSR a member of the library staff? (n=57) 

Yes 34 60% 

No 23 40% 



Please indicate any other data related consortia with which your institution is affiliated. (n=64) 

Roper Center for Public Opinion Research 17 27% 

Council of European Social Science Data Archives (CESSDA) 0 0% 

Networked Social Science Tools and Resources (NESSTAR) 0 0% 

Other 16 25% 

Please explain: Other consortia listed included European Consortium for Political Research, 
R-Cade, Statistics Canada's Data Liberation Initiative, Sociometrics' Electronic Social Science Data 
Library and statewide consortia. 



Access to Machine-readable Numeric Data Products 



10. Which of the following statements best describe the library's policy on access for machine- readable 
numeric data? Check all that apply. (n=64) 



Walk-in users may access web-based or remote data while in the library 
Access is freely available to the institution’s community, on and off campus, for local 
and remote data 

Walk-in users may access networked, locally mounted data while in the library 
There is no formal access policy 

Access is restricted to students, faculty & staff use only 

Access is restricted to specific user groups only (e.g. graduate students, faculty, etc.) 
Other 

Please explain: There is a high degree of variability in licensing; thus, nearly every 
data source has a different access policy. A preponderance of respondents noted that walk-in users 
have access to most all resources. Government Documents were often singled out as having virtually 
no access restrictions. 



38 


59% 


37 


58% 


37 


58% 


22 


34% 


21 


33% 


2 


3% 


16 


25% 



11. If machine-readable numeric data products are cataloged, how are catalog records made available to 



users? Check all that apply. (n= 


=64) 




OPAC 


62 


97% 


Specialized catalog for data 


15 


23% 


Card Catalog 


0 


0% 


Other 


12 


19% 



Please explain: Many respondents relied on lists presented on web pages. Several 
respondents had some sort of cataloging scheme in development. 



Staffing 

12. Does the library employ someone specifically to acquire numeric data products? (n=64) 

Yes 24 38% 

No 40 62% 

If yes, please provide that person's title. (n=21) 

Most titles reported were variations of Data Librarian or Data Services Librarian. 

Does this person hold the M.L.S. degree or its equivalent? (n=24) 

Yes 19 79% 

No 5 21% 
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What other relevant degree(s) does this person hold? (n=16) 

These positions are filled primarily with individuals possessing advanced degrees, but a few 
libraries have Bachelors level employees. Among the degrees reported were: 

MA Political Science 

MA Eastern European Area Studies 

MA Geography 

MA Anthropology 

MA English Literature 

MS Statistics 

MS Computer Science 

BA Social Sciences (Economics) 

BA Political Science 
BA History 



13. Does the library employ someone specifically to handle service requests for numeric data? (n=64) 

Yes 31 48% 

No 33 52% 

Does this person hold the M.L.S. degree or its equivalent? (n=31) 

Yes 20 65% 

No 11 35% 

What other relevant degree(s) does this person hold? (n=15) 

As with Question 12, these positions are filled primarily with individuals possessing advanced 
degrees, but a few libraries have Bachelors level employees. Degrees held by a person different from 
the one in Question 12 include: 

MA History 
MA Political Science 
MA Mathematics 
Ph.D. Political Science 
BA Economics 
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Please indicate the number of staff (FTE) in each category specifically charged with providing data 
services. 





Minimum 


Maximum 


Mean 


Median 


Standard Deviation 


Librarians (n=50) 


0 


25 


2.01 


1.0 


4.40 


Support Staff (n=32) 


0 


4 


.94 


.9 


1.01 


Students (n=24) 


0 


12 


1.01 


.25 


2.44 


Other (n=20) 


0 


2 


.60 


.75 


.57 



Please explain: Most respondents replied that a diverse group provided service and 



reference to their communities without being "specifically charged" to do so. 



Budget 

15. Approximately, what amount is budgeted for the acquisition of numeric data products within the 
library? (n=27) 



Minimum 


Maximum 


Mean 


Median 


0 


$200,000 


$31,825 


$12,000 



Range 


n 


0-1,999 


8 


2,000-9,999 


4 


10,000-99,000 


12 


100,000-200,000 


3 



Evaluation of Numeric Data Collections & Services 

16. Please explain which techniques and/ or measures (e.g. user surveys, usage statistics, comparison 
with peer institutions, etc.) your library uses to evaluate the quality of your numeric data collections 
and/ or services. (n=35) 

A preponderance of respondents report they do not perform any type of analysis. Those who do, 
use usage statistics and user surveys, informal feedback from faculty and students, and comparisons 
with peer institutions to measure the quality of data services. 

17. We would like your impressions about the quality of your library's numeric data collections and 
services relative to your expectations. Please think about the two different levels of expectations 
as defined below: 



Minimum Service Level: the minimum level of service performance you consider adequate for a 

research library to provide. 

Desired Service Level: the level of service performance you desire from a research library. 

ERIC 



20 20 



For each of the following statements, please indicate: (a) the minimum service level a research 
library should offer by choosing one of the numbers in the first column; (b) the desired research 
library service level by choosing one of the numbers in the second column; and (c) your 
perception of your library’s actual service level by choosing one of the numbers in the third 
column. 





The Minimum Research 
Library Service 
Level Is: 


When it comes to... 


Mean 


Std Dev 


n 


Convenient access 
to numeric data 
collections 


5.32 


1.82 


60 


Providing 
reference services 
for numeric data 


5.77 


1.93 


60 


Acquiring numeric 
data products 


5.19 


1.77 


58 


Creating 

partnerships with 
other data units 


4.86 


1.94 


55 


Collecting user 
feedback on 
numeric data 
products and 
services 


4.56 


1.99 


59 


Adding to numeric 
data collections on 
request 


5.42 


2.26 


59 


Maintaining error- 
free catalog records 
for numeric data 


6.17 


2.20 


60 



The Desired Research 
Library Service 
Level Is: 


Mean 


Std Dev 


n 


8.23 


1.05 


60 


8.28 


1.11 


60 


8.05 


1.16 


58 


7.80 


1.89 


55 


7.54 


1.60 


59 


8.21 


.92 


57 


8.55 


1.08 


60 



My Perception of My Library's 
Performance Is: 


Mean 


Std Dev 


n 


5.80 


2.00 


59 


6.37 


1.91 


59 


6.29 


1.61 


58 


5.56 


2.34 


54 


4.00 


2.24 


58 


6.32 


2.19 


57 


6.60 


1.94 


58 
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Additional Comments 

18. Please submit any additional information regarding the collection, use, or dissemination of numeric 
data at your institution that may assist us in accurately analyzing the results of this survey. 

Many respondents aired their views regarding this survey's definition of numeric data. Some found 
it confusing while others found it out of conformity with traditional definition of a clear demarcation 
between raw data and statistics or already massaged raw data. 

Interestingly, whereas some indicated that numeric data has never been thought of as a separate, 
distinct part of the collection, others pointed out that numeric collections are very different from 
traditional library services. The result was that both groups found that the questions difficult to 
answer. 

Some respondents rightly pointed out the difficulty in knowing what others are doing with numeric 
data, simply due to the diffusion of services across many units. 

Lastly, several respondents suggest that their service to numeric data is in its infancy — 0-2 years 
old — thereby making it difficult to answer many of the questions. 



Responding Institutions 



Auburn University 
Boston College 

University of British Columbia 
University of Califomia-Davis 
University of California-Irvine 
University of California-San Diego 
Canada Institute for Scientific and Technical 
Information 

Case Western Reserve University 

Colorado State University 

Cornell University 

Duke University 

Emory University 

University of Florida 

George Washington University 

Georgetown University 

University of Georgia 

Georgia Institute of Technology 

Harvard University 

University of Hawaii 

University of Iowa 

Iowa State University 

Johns Hopkins University 

University of Kentucky 

Laval University 

McMaster University 

University of Manitoba 

University of Maryland 

University of Massachusetts 

Massachusetts Institute of Technology 

University of Miami 

University of Michigan 

Michigan State University 



University of Minnesota 

National Library of Canada 

New York University 

University of North Carolina 

North Carolina State University 

Northwestern University 

Ohio University 

University of Pennsylvania 

University of Pittsburgh 

Princeton University 

Rice University 

University of Rochester 

Rutgers University 

Smithsonian Institution 

Southern Illinois University 

State University of New York at Albany 

State University of New York at Buffalo 

Syracuse University 

Temple University 

University of Tennessee 

University of Texas 

University of Toronto 

Vanderbilt University 

University of Virginia 

Virginia Tech 

University of Washington 

Washington State University 

Washington University 

University of Waterloo 

Wayne State University 

University of Western Ontario 

Yale University 

York University 
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Representative 

Documents 




Association of Research Libraries 




Office of Leadership and 
Management Services 
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The University of British Columbia 
COLLECTION DEVELOPMENT POLICY: 
DATA SERVICES 



October 1994 

FACULTY: Data Services has a campus-wide mandate to serve data teaching and research requirements. In 
practice, and due to limited resources, we serve mainly the Faculties of Arts and Commerce. There are, however, 
regular as well as occasional users in many other faculties, particularly in Agricultural Sciences and Education, 
and in the Health Sciences. 

DEPARTMENT/SCHOOL: 

RELATED PROGRAMS/INSTITUTES/CENTRES: 

RESEARCH AND PUBLISHING CHARACTERISTICS: For Data Services purposes, machine-readable data 
files (MRDFs, data files, or data sets) are defined to include numeric, full text, image and similar data in digital 
form. Numeric files may contain micro- or macrodata, and may be rectangular (flat), time-series, cross-sectional 
or longitudinal files. Bibliographic or catalogue-type files are not included, nor is computer software of any kind. 
Data Services acquires primarily numeric files. 

Increasing numbers of MRDFs are being produced in many subject areas, in a variety of physical formats. At 
universities more and more quantitative analysis is being performed in business schools, in social science, science 
and medical faculties. Full text analysis is becoming increasingly popular in the humanities, and image data are 
used in many disciplines. 

MRDFs are frequently difficult to identify and locate since they are not usually distributed through the 
conventional commercial channels. They are often created at academic institutions for local research purposes or 
by government departments as policy planning tools, and are subsequently made available for sale to other 
interested researchers. 

Data files are distributed on media such as magnetic tape, personal computer disks, or CD-ROM disks. 
Distribution of files on the Internet is, however, growing rapidly and seems set to become the primary medium of 
dissemination. 

Data may be distributed in 'raw' format (no value-added component) or in 'packaged' format (complete with 
retrieval software). Raw data are most commonly distributed on magnetic tape or via the Internet and tend to be 
produced by universities and government departments, whereas 'packaged' data come on floppy or CD-ROM disk 
and are often marketed by the private sector. Data Services acquires raw data files only. Packaged files are 
acquired by the appropriate subject division in the Library. 

In general, Data Services acquires files which can be made available both via the campus network and to 
everyone at the University. CD-ROM and PC-based databases are thus generally excluded. 

Increasing numbers of numeric files are available for direct searching on the Internet. Through its gopher server 
(accessible through ViewUBC), Data Services maintains direct connections to a number of computing sites 
housing such files. 

Codebooks: Every data file requires a codebook for its interpretation. The codebook contains details about the 
structure and contents of the file and relevant background materials. It is impossible to use a raw data file without 
access to the codebook. For this reason two copies of all printed codebooks are acquired: one is anchored, the 
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other circulates. The anchored set of codebooks is housed in Data Services; circulating copies are soon to be 
moved into the Main stacks. 

In the case of computer-readable codebook files, Data Services generally produces one printed copy (anchored). 
The codebook file is made available for users to access online or print their own copies. 

CURRICULUM AND RESEARCH SUPPORT LEVEL: Canadian and U.S. socio-economic and financial 
time-series, international economic indicators, micro- and macrodata of relevance to Commerce, Economics, 
Political Science and Sociology. 

CURRICULUM SUPPORT LEVEL: see previous paragraph. 

SPECIAL EMPHASIS: All Canadian Census data, and all Canadian Institute of Public Opinion (CIPO) Gallup 
polls; other Canadian survey series such as the General Social Survey, the Labour Market Activity Surveys and 
Surveys of Family Expenditure. 

HIGHLY SELECTIVE: Other U.S. and international numeric data files; full text files; digital map data; satellite 
images, etc. Increasingly, full text and digital map files are acquired by other appropriate Library 
branches/divisions. 

Printed collection: Data Services has a small collection of books and serials in the area of survey design, data 
collection and analysis, and preservation of magnetic media. It also acquires the published catalogues of data 
archives and data publishers around the world although, increasingly, these are becoming available on the 
Internet. (See also 'Collections in other UBC libraries/affiliates' below). 

LANGUAGE COVERAGE: Primarily English (applicable to codebooks only). 

GEOGRAPHIC ORIGIN: All significant Canadian data files (especially microdata) for which there is 
demonstrated demand (Canadian being defined as 'having Canadian content'). U.S. and international data files 
acquired selectively, as funds permit. 

COLLECTIONS IN OTHER UBC LIBRARIES/AFFILIATES: CD-ROM products (with retrieval software) 
in the David Lam Library (financial time-series) and Government Publications (Canadian census and other 
government data). 

Large numbers of socio-economic time-series exist in printed as well as in machine-readable format, e.g. 

CANSIM (equivalent printed sources in Government Publications) and company balance sheet information 
(equivalent printed sources in the David Lam Library). 

Books on survey design, on data collection and analysis, and on statistical analysis software are also found in the 
Main Library, the Sedgewick undergraduate library, and the Math Library. Official reports and other materials 
related to government-produced data files are generally available in Government Publications. 

Most of Data Services' printed collection will be moved into the Main stacks, and collection development 
responsibilities have already been transferred to appropriate subject bibliographers. 

COLLECTIONS IN NON-UBC LIBRARIES: The nearest data libraries are at Simon Fraser University and at 
the University of Alberta. There are a handful of other Canadian data libraries, and large numbers of U.S. data 
libraries. Most data files have to be bought under licence, and data libraries can make such files available only to 
members of their own campus communities. We are thus effectively barred from accessing files elsewhere, unless 
we hold a licence for the same files or, as in the case of CANSIM, one university holds a distributor's licence and 
other universities purchase client licences for the same database. 

21 



o 




30 



University of British Columbia 



AREAS OF OVERLAP: Data available in different physical formats in David Lam and Government 
Publications. Identical data files available in other data libraries. Printed collections as specified under 
’Collections in other UBC libraries' above. 

INCONSISTENCIES THAT NEED TO BE RESOLVED: Data Services was founded as a separate branch 
library in 1972 to handle materials in a format different from traditional library materials, i.e. electronic, numeric 
data files on magnetic tape. The use of electronic data of all kinds has expanded rapidly as has network access to 
such data. More and more files are being made more easily accessible. The reasons for which a separate data 
library covering a wide range of subjects (but only for materials in a specific format, and with only two-and-a-half 
staff members) was founded are therefore no longer valid, and data library services are being mainstreamed. 
Services in this area will no longer be determined by format, but by subject, as indicated in various sections 
above. 

Another question that needs to be addressed relates to the fact that the same data file is frequently produced in a 
number of different formats. The CANSIM database, for example, can be accessed remotely through commercial 
vendors. It is also available as raw data on magnetic tape, as a CD-ROM database packaged with retrieval 
software and, of course, in a wide variety of printed sources. It should be established, probably on a case-by-case 
basis, whether and under what circumstances the same database should be acquired in different formats. 

There is considerable demand for biomedical data (such as genetic and protein sequences) on campus, but Data 
Services does not have the resources to accommodate it. The Woodward Biomedical Library has agreed to take 
responsibility for this area. 

ADDITIONAL NOTES AND COMMENTS: 

Data contracts: For most of the files acquired by Data Services a contract or licence agreement with the data file 
supplier has to be signed, which generally limits usage of the files to the UBC academic community. Data 
Services does not usually acquire data files that, under contractual agreements, can only be used by a single 
person. Any data file that we maintain must, at least, be accessible to all members of the UBC academic 
community. 

Consortia purchases: Data Services is a member of the CARL (Canadian Association of Research Libraries) 
purchasing consortium for Canadian Population and Agricultural Census macro- and microdata files, and for the 
Canadian General Social Survey series of microdata files. It is also a member of ACCOLEDS (A COPPUL 
Consortium of Library Electronic Data Services). The latter purchases an annual federated membership in the 
ICPSR (Inter-university Consortium for Political and Social Research). Both arrangements result in considerable 
cost savings for Data Services. Consortia agreements should be pursued wherever possible. 

Original data files created at UBC: Data Services has an archival function and will therefore collect original 
data files produced by local principal investigators in order to preserve them for posterity, provided that the 
principal investigator agrees to Data Services' standard conditions. 

Data Services will also accept, if offered, original data files from any source for deposit, provided that the 
principal investigator agrees to the Data Services' standard conditions. 

Resource sharing: Data Services is beginning to share networked databases with other universities, the first 
candidate being the CANSIM database from Statistics Canada. Resource sharing can result in the rationalization 
of costs and in increased access to data for our users. This is a major growth area with great potential for libraries. 

Submitted by: Hilde Colenbrander 

Date: October 13, 1994 
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ADDENDUM: 

Social science data: As noted earlier, Data Services maintains an annual membership in the ICPSR at the 
University of Michigan (through the ACCOLEDS consortium). It is also a member of the Roper Center for Public 
Opinion Research at the University of Connecticut. As far as possible, social science data files are acquired from 
the ICPSR and the Roper Center. 

The Roper Center membership secures all Canadian Institute of Public Opinion (CIPO) polls, and sometimes 
other public opinion poll data as well. 

Cost sharing: The annual ACCOLEDS ICPSR membership fee is shared equally between Data Services and the 
Department of Political Science. Membership entitles us to access to some 40,000 data files in the ICPSR archive 
which can be acquired on demand and at little additional cost. The data librarian is UBCs 'Official 
Representative' to the ICPSR. 

Financial and economic time-series data: Data Services maintains annual subscriptions to the following 
machine-readable time-series databases: 

• CANSIM Main Base-distributor's licence (Statistics Canada) 

#Citibase economic database (Citibank) 

ifi COMPUSTAT (Standard & Poor's, Inc.) files: 

'^1 Industrial annual 
^ Industrial quarterly 
y Canadian annual 
Bank annual 

ffiCRSP (Centre for Research in Security Prices) files: 

3NYSE/AMEX daily returns 
SNYSE/AMEX monthly master/retums 
^NASDAQ daily returns 
^Government bonds 

HlBES (Institutional Brokers Estimate System) 

1® IMF (International Monetary Fund) files: 

|j*i Balance of payments statistics 
Direction of trade statistics 
^Government finance statistics 
!®1 International financial statistics 

Cost sharing: The COMPUSTAT and CRSP databases are purchased jointly with the Faculty of Commerce. The 
Faculty paid for some of the initial database fees; annual update fees are shared equally between the Faculty 
(through the David Lam Library) and Data Services. 
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COLLECTION DEVELOPMENT STATEMENT 
FOR MACHINE READABLE DATA FILES 



INTRODUCTION 

This policy is intended to provide guidelines for the acquisition of machine readable data files. 
As the technology for the electronic storage of information changes, and as different kinds of 
information are made available in machine-readable formats, this policy will require revision. 
The current guidelines provide adequate physical and bibliographic access for a rather limited 
collection of specialized files. As the collection grows in scope and size, new guidelines may be 
required. 

DEFINITION OF MACHINE READABLE DATA FILES 

A machine readable data file (MRDF) is any information - numeric, textual, bibliographic, or 
some combination of these - stored in an electronic medium which is readable only by machine. 
A typical MRDF would consist of numeric data stored as electronically recorded signals on 
magnetic recording tape and readable by a computer, but other formats such as floppy disks, 
hard disks, video disks, and compact disks may become more common. 

GOALS 

The goals of this machine-readable data file policy are to a) affirm that the Library will acquire 
information in the format or formats which are most useful to the UCSD community, b) provide 
criteria for selection to MRDFs including those criteria which are specific only to MRDFs, and c) 
outline the role of the Library in providing access to MRDFs. 

SCOPE 

This policy covers data in any machine-readable format. It covers any type of library acquisition 
including purchase, lease, gift, or other means of acquisition which add material to the library 
collection. 

POLICY 

Machine readable data files acquired by the Library must meet the same criteria that other 
formats must meet to be added to the collection, i.e., the data or information must support an 
identifiable current or future research or curriculum need on campus. The extent of this need 
must justify the expenses of acquiring, processing, and maintaining the data file. In addition, 
acquisition of MRDFs by the Library should involve the following considerations: 

1. Codebooks. There must be a complete and accurate codebook or user guide which 
details data structure and format, defines each data element, and explains all codes used. 

2. Other Documentation. Other documentation, such as a description of how the data 
were collected, may be required for some data sets in order for the data user to evaluate 
and use the data. 

3. Physical Format. The machine-readable format must be compatible with machines 
available to the UCSD community. 

4. Software. Before acquiring a machine-readable data file, the Library must be assured 
that the UCSD community will have software access to the data file. While the Library 
does not, at this time, have the staff to create software, software access can be assured in 
any of several ways: 

a. The library can acquire, through purchase or other means, available 
software. 

b. The data file may be accessible by software already available to the UCSD 
community. 

c. Programming for accessing the data file may be made available (for example, 
by an academic department). 
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5. Duplication of Data. As with other formats, the content of the data should be 
evaluated in terms of how much it duplicates data already available in the collection. It 
may be appropriate, however, to make data available in more than one format (e.g., in 
printed format and in machine readable format). Each case should be evaluated 
individually. 

6. Authority of Data. As with other formats, the authority and completeness of the data 
should be considered. 

7. Online Availability. Many MRDFs are available online. Before purchasing a file 
which is available online through vendors or search services, the costs and uses of on- 
campus availability versus online availability should be considered. 

8. Accessibility. Some data files may have restrictions on their use. Examples of 
restrictions include: data may be limited to ’academic use” only; data may not be copied; 
data may contain proprietary or confidential information. Data files which have 
restrictions with which the Library cannot comply will not be acquired. 

PROCEDURE 

The Library will acquire only those data files which meet the policy criteria stated above and this 
will normally occur only after a member of the faculty requests a specific file. The Library will 
evaluate the acquisition using the criteria listed above. The evaluation process will include the 
appropriate subject bibliographers, the MRDF bibliographer, the AUL for collection 
development, and the faculty in appropriate departments. 

The Library will acquire codebooks, user manuals, and other documentation along with any 
datafiles and will catalog and maintain these materials. 

THE ROLE OF THE LIBRARY 

The Library has several roles in assuring campus access to machine readable data files. 

1. The Library will help identify the existence of data files not in the Library collection. 
The Library will continue to take an active role in keeping its users informed of the 
availability of information pertinent to their needs and will make a particular effort to 
identify machine readable data. 

2. The Library will acquire the data files, and will insure that the files meet the 
standards set forth in the selection criteria listed above. When data files are available 
from more than one source, the Library will determine the best and most reliable source. 

3. As a federal government depository library, the Library has a particular commitment 
to making government-produced information available. Because the federal government 
does not, at this time, include machine readable formats in the depository system, the 
Library will take an active role in attempting to assure access to such government- 
produced machine readable information. 

4. The Library will maintain memberships in organizations such as the Inter-University 
Consortium for Political and Social Research and the International Survey Library 
Association in order to provide reliable and easy access to datafiles. The funding sources 
for such memberships have not been established at this time. 

5. The Library will acquire, catalog, and maintain codebooks and other user guides, 
making them readily available to users of the Library collections. 

6. The Library will provide reference service for the printed materials and referral 
service to the machine readable data files as part of the routine of providing information 
services. 

7. The Library may produce catalogs, lists, or other materials to aid in identifying 
machine readable data files in the Library collection. 
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NUMERIC DATA SERVICES 

SOCIAL SCIENCES PROGRAM 
of iht Htirvvrd Golltgr. I.fbritry 

Collection Development Policy for Social Science Data 
Sets 

GOALS OF THE POLICY 

• Define roles of SSP Area Specialists and the Social Science Data Librarian with 
respect to the acquisition of machine-readable data files; 

• Communicate to our users the general criteria we use in making purchasing 
decisions for data; 

• Act as a planning document to define the roles of the SSP and the Harvard-MIT 
Data Center (HMDC) on data acquisitions issues; and 

• Encourage collaboration and co-operation among SSP and other Harvard units to 
provide effective and cost-effective data collections at Harvard. 

SCOPE 

This policy covers machine-readable numeric data and accompanying documentation 
for use in social science research and coursework at Harvard University. It does not 
include data acquired through the Harvard Map Collection or the Environment 
Program of the SSP. Machine-readable data come in a variety of formats such as 
magnetic tape, diskette, CD-ROM and through online services. The time coverage 
includes both historic and current numeric data sets. 

The acquisition of numeric data in print format follows the same guidelines as applied 
to other print materials acquired by the SSP. 

USERS 

The collection of machine-readable numeric data files supports research and teaching 
in the social sciences at Harvard. Data users come from a wide range of Departments 
and Schools at Harvard. 

Historically, the primary data users were faculty members and graduate students, but 
increasingly there is demand for numeric data by undergraduate students. Changes in 
technology have simplified access to numeric data and created an even larger 
audience to serve. Many undergraduates now are expected to use "real -life" data in 
their upper-year coursework (ex. statistics or econometrics) and in their theses. 
Harvard undergraduate students are expected to demonstrate proficiency in data 
interpretation in their first-year by meeting the Quantitative Reasoning Requirement. 

The general public has access to Harvard's government documents collection, 
including many sources of numeric data in print, diskette, CD and online subscription 
format. 

EXISTING COLLECTIONS 




About Numeric) 
Data Serviceb 

Numeric Data 
Services online 

Numeric Data 
Services in person 

What's New at Numeric 
Data Services 

The Social Sciences 
Program 
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Harvard is a member of the United States Federal Depository Library Program, as 
well as the Canadian and United Nations depository programs. Some numeric data 
files are received free as part of the depository programs; however, many government 
machine-readable numeric files are outside the scope of the depository collection and 
require separate purchases. Harvard is a European Union depository library through 
the Law School Library. 



As of August 1998, the Government Documents unit and Littauer Library have a 
combined collection of approximately 550 CD titles (amounting to over 1900 
individual CD's) and approximately 55 diskette titles (250 individual diskettes). The 
collection of numeric data sets also includes subscriptions to online services such as 
STAT-USA, and Statistical Universe on Hollis Plus. 

I 

! COLLECTION GUIDELINES 

As part of our mission, we strive to have a proactive collection development process 
where selection is guided by the academic programs at Harvard and the anticipated 
demand for data files from the Harvard community. The collection is developed in 
conjunction with area specialists in the SSP, the Harvard-MIT Data Center and other 

I interested units at Harvard. 



General guidelines for data acquisition include the following: [adapted from ICPSR 
Management of Machine-Readable Social Science Information Workshop, August 
1996] 

i 

• relevance of the data to research and teaching at Harvard; 

• current and potential user demand; 

• format of the data; (do we have the necessary hardware and software to support the 
data file) 

• one-time purchase or are updates required; 

• restrictions on the use of the data; 

• adequate documentation; 

• quality of the data; 

• quality of the search software; 

• data producer reliability; 

• required technical support; 

• required reference service; 

• data distribution (stand-alone or networked). 

i 

' Access 

I 

Many machine-readable numeric data files in the SSP collection are distributed in 
CD-ROM or diskette format. Many of these files are considered reference materials 
and need to be available in the library at all times. All CD-ROMs and diskettes are 
catalogued and many are available for use on the workstations in Littauer Library and 
j the Government Documents Reading Room. 

! The collection of CD-ROMs is divided into 3 categories: 



Level "1 " Service 

- CD's will be networked for use in Littauer Library and/or the Government 
Documents Reading Room; 

- Reference Staff will be trained on use of these CD's (content, searching, 
downloading); 

- In-house guides will be provide^f^* users. 
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Level "2 " Service: 

- Local access; software will be loaded on Littauer or Government Documents 
workstations and user will ask for CD's at the circulation/reference desk; no 
circulation of CD's allowed; 

- All user documentation will be in the public area; 

- At least one staff member will be familiar with CD's general use and will be 
available for user 

consultation. 

Level "3 " Service: 

- No local access; 

- CD's and documentation circulate; 

- No user support; library staff not expected to provide reference. 

Level 3 CD's will be barcoded and allowed to circulate to current Harvard ID holders 
for 48 hours. Users are responsible for returning CD's on time and in good condition. 
Users who do not comply with these rules will be billed for the replacement of the 
CD's. 

All diskettes will be catalogued separately, barcoded, and kept at the Littauer Library 
Reserves Desk or the Government Documents Reference Desk, according to subject 
lines. The back-up copy of a diskette will circulate for a period of 48 hours. Users 
who do not comply with these rules will be billed for the replacement of the diskettes. 

Documentation 

Littauer Library houses the HMDC's collection of ICPSR codebooks. Many ICPSR 
codebooks are available online in pdf and HTML format. The SSP will catalog and 
maintain a collection of print ICPSR codebooks, non-ICPSR codebooks, and other 
supporting materials for data files in the collection. 

User Requests for Data Purchases 

Suggestions for data purchases will be considered on an individual basis. The 
guidelines listed above (under Collection Guidelines) will be used when making 
acquisition decisions. 

Back-up Policy 

The staff of the Social Sciences Data Service will compile an on-going list of 
machine-readable numeric data files which require back-up. Criteria for back-up 
include importance of the data to users and to the collection, availability of alternative 
sources, cost of original purchase, original delivery format, etc. Files will be backed 
up on CD-ROM and stored in the Harvard Depository. 

The back-up policy is not intended to be an archival policy. At this time there are no 
plans to function as a data archive or to engage in long-term data preservation 
activities. 

OTHER AVAILABLE RESOURCES AT HARVARD 

The SSP will work with other Harvard units to discuss data acquisitions and service 
issues to maximize co-operative data collections and access for the Harvard 
community and minimize duplicate costs and efforts. 

34 



37 



University of Pennsylvania 



http://www.library.upenn.edu/services/collections/policies/diginfot.html 

^UNIVERSITY OF PENNSYLVANIA LIBRARY 



Collection Development Policy: Digital Information Resources 



Introduction. 

The Penn Library was an early adopter of information in electronic form, beginning in 1973 when librarians mediated 
batch processed searching of remote databases on behalf of patrons. Since that time the number of digitized resources 
has increased dramatically; full-text and numeric products have appeared, and time-share based services have been 
followed by CD-ROMs, floppy disks, magnetic tapes and other media for local loading, and access through remote 
servers via the world-wide web. Digital resources have proven to be essential to the support of scholarship, and the 
Library provides a selection of them to the University in accordance with programmatic needs, anticipated use and 
available resources. The purpose of this policy is to establish consistency within the Library in managing this 
increasingly important part of the collections. 

Scope of this Policy. 

This policy covers networked academic digital information resources purchased or leased by the Library on behalf of 
Penn faculty, students and staff in all schools and libraries except Law (although members of the Law School have 
access to our digitized information). These resources may be locally loaded or accessed remotely. All are available 
through the Penn Library web page to authorized users. 

This policy does not govern the acquisition of non-networked products, resources networked within a building, 
instructional software, applications software or software intended for internal library and/or administrative use. 

The Schoenberg Center for Electronic Text & Image (SCETI) creates digitized images of some of Penn's rare and/or 
unique resources. See the SCETI homepage for details. For digitized visual resources in support of art and 
architecture, see the policy for Visual Resources currently under preparation by William Keller, the Fine Arts 
Librarian. 

General Policies. 

Selection of Networked Resources. 

The Library purchases or leases high quality digital resources of major value to Penn programs and user groups in 
accordance with subject collection development policies, and makes available initially a sufficient quantity of data to 
be useful. The Director for Collection Management and Development facilitates the selection process and consults 
with other directors, departmental librarians, bibliographers and liaison librarians as appropriate. As with selection in 
other formats, the Library welcomes suggestions from faculty, students and library staff members. 

This is a transition period in the provision of academic information, and the Library's practice varies depending on the 
type of resource. Reference and indexing and abstracting tools have largely moved to digital formats, and the Library 
no longer receives the print equivalents. However, the Library does not expect to cancel print journals even when 
digitized versions are available until archiving and long-term access are assured. As a result some information is 
provided at present in both digital and print forms. This is not a practice the Library can maintain over time. As funds 
permit the Library buys access to some full-text resources deemed to have long-term value in support of Penn's 
academic programs. When access to a particular resource is not peimanent because of product changes, cost or 
technical reasons, the Library hopes to provide an alternative. The Library may also move a resource from one 
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platform to another when the new interface is superior to the old. In response to faculty and student requests the 
Library is now acquiring more full-text resources. 

The Library acquires some digital resources through package deals with major suppliers such as the Research 
Libraries Group; others are selected individually. 

Digital resources require continuing management to a far greater degree than print resources do. In general it is the 
responsibility of the sponsoring librarian to keep up with changes in an interface, selecting an alternate vendor and 
alerting staff and users to problems. Responsibility for writing documentation is shared by collection development and 
public service librarians. 

Selection Criteria. 

Selection of a digital resource requires the careful weighing of a number of content, presentation, functionality, 
technical, cost and management factors. Among these are the following: 

Content: 

° The importance of the resource for its discipline and the level of use expected at Penn. 

° The comprehensiveness, durability and accuracy of the database. 

° The currency of the information and update schedules. 

o The distinctiveness of the database in the Penn environment, the extent of overlap (if any) with other 
Penn databases, and the relationship of the database to the Library's print and other holdings. 

° The size of the database. 

Presentation: 

° Design of the interface. 

° Willingness of the vendor to make changes for Penn's implementation. 

° Platform(s) available. 

Functionality: 

° Functionality, including link to the online catalog, integrated ILL request capability, output formats. 

° The potential of the database for secondary use, and the availability of data apart from that embedded in 
the interface. 

Technical: 

° The Library's digital resources must conform to current university-wide desktop computing standards. 

Obsolete formats and platforms are not generally supported. 

° Ease of installation and maintenance, if required. In all cases the Library must have the technical 
capacity and staff needed to make available and maintain a database proposed for purchase. The 
Director for Information Systems makes this determinatio n. 

° The Library strongly prefers web versions of digital resources and will only acquire a Window's version 
in cases where there is no alternative. 

Cost: 

° The cost of the database. 

° The availability and cost of computer storage space if needed. 

Management: 

° Availability and content of usage reports. 
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Funding. 

The funding pattern for digital information resources has changed over the years. Initially most were paid for out of 
central library funds under the management of the Director for Administrative Services. Now most resources (with the 
exception of Citadel files) are paid for out of the Information Budget. The Information Processing Center is 
responsible for invoice payment. 

Proposals to purchase or lease a database should be directed by the sponsoring librarian to the Director of Collection 
Management and accompanied by a realistic funding package which takes into consideration all the costs of a 
particular product, including internet access fees, server space and local staff time. The cost of a new database is 

initially borne by the appropriate subject fund or funds, and bibliographers are encouraged to collaborate and pool 
their resources. In subsequent years payment may come from the Digital Resources Fund at the discretion of the 
sponsoring librarian and if sufficient money is moved from the subject allocations to the Digital Resources Fund. 

Databases which the Library purchases on a onetime basis are paid for out of the appropriate subject fund. 

The Library participates in consortial purchases or leases of databases when it is financially advantageous to do so. 

Licensing. 

The Director of Collection Management and the appropriate bibliographer(s) review the license agreements for new 
databases; the Director signs the agreements. 

The Library subscribes to the July 15, 1997, "Principles for Licensing Electronic Resources," drawn up by a group of 
library organizations, including the American Library Association and the Association of Research Libraries; and to 
the March 25, 1998, "Statement of Current Perspective and Preferred Practices for the Selection and Purchase of 
Electronic Information," put forth by the International Coalition of Library Consortia (ICOLC). 

In reviewing licensing agreements the Library seeks to protect its rights to information to the fullest extent allowable 
under law and includes the following statement in all new license agreements it negotiates: 

"The Board of Trustees of the University of Pennsylvania reserves the right to use the licensed (or purchased) 
work to the fullest extent of applicable laws, including the right of fair use under U. S. copyright law, including 
but not limited to the right to reproduce (or have reproduced) copies for its nonprofit, educational purposes, 
such as for comment, criticism, news reporting, teaching (including multiple copies for classroom use), 
scholarship and research. This term is a material part of the contract between the parties and supersedes any 
contract terms contrary to or inconsistent with such rights under law." 

In addition, the Library pays particular attention to its rights to continuing access to information paid for; the 
archiving of information; restrictions, if any, on the use of information for sponsored research; and confidentiality 
clauses. 

At present the Director of Collection Management holds the license agreements for negotiated databases; the 
Acquisitions Department holds the license agreements for electronic journals. The goal is to consolidate all license 
agreements in a single location. 
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Electronic Media 

Management of Electronic Access Databases Group (MEAD) 
bb546@csc.albany.edu 

I. General Purpose 

The University Libraries select, house, and provide access to information published in many formats. In 
recent years traditional formats for information, such as books and microfilm, have increasingly been 
supplemented by information which is accessible only through the use of a computer. This policy 
addresses collection parameters for electronic media which the University Libraries expect to collect. 
Further, the University Libraries staff are aware that however flexible this policy, the speed with which 
technological changes are occurring with respect to information access will necessitate frequent revision 
of this document. 

Selection of materials within the scope of this policy is the responsibility of various individuals and 
groups. Subject Bibliographers select all electronic media which fall within their regular subject 
assignments. Interactive media packages are selected by the Bibliographer in consultation with the 
Interactive Media Center (IMC) Coordinator. Bibliographers request technical assistance from the IMC 
Coordinator in determining the compatibility and advisability of electronic media purchases. They may 
submit to the IMC Coordinator purchase requests for interactive media. The Management of Electronic 
Access Databases Group (MEAD) approves purchase of all CD-ROM subscriptions and subscriptions 
to other electronic resources to support reference service upon the recommendation of University 
Libraries Reference librarians. The MEAD Group also addresses networking of CD-ROM products and 
purchases of products accessed via the Internet, as well as the library’s approach to new electronic 
formats. The Government Publications librarian selects electronic media for the documents collection 
and librarians in other divisions of the libraries select electronic media needed by that division to 
support their work. 

II. Subject and Language Modifiers 

There are no exclusions based on language, chronological period or geographical area, but materials 
should reflect the present and expected future curricular and research needs of the University. 

III. Description of Materials Collected 

The University Libraries view acquisition of electronic media as a natural expansion of existing 
collection development activities. Materials are selected in all formats for which the University Libraries 
own or expect to own in the near future the equipment for their use; exceptions may be made when an 
academic department owns a specific piece of equipment and purchases are confined to materials 
requested by that department. 

Electronic media can be categorized as follows: 

1 . Bibliographic files, e.g., MLA or ERIC ; 

2. Textual files, e.g., Nuremberg War Crimes Trials Online ; 

3. Numeric data files, e.g., Compustat ; 

4. Graphic and multimedia files, e.g., National Gallery of Art Laserguide ; 
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5. Courseware/instructional files, e.g., Hvpercell ; 

6. Internet files that fall under categories 1-5; and 

i ' 

7. Specific applications software needed to utilize the resources listed above, e.g., 

Excel . i 

8. Other types of electronic resources will be considered as they are developed, in 
light of their relevance, appropriateness, and contribution to the University 
Libraries' mission. 

Electronic resources may be organized, collected, and distributed in a variety of formats. These include, 
but are not limited to, magnetic disks, magnetic tape, and optical disks. Delivery systems include PCS, 
LANs, and remote servers. 

General Selection Criteria 

Electronic media considered for acquisition by selectors should: 

• follow all current collecting guidelines as represented by the subject collection development 
statements and other appropriate documents; 

• represent materials useful and important to a significant segment of the Library's user 
community, and reflect current curricular and research needs; 

• be available in formats for which the University or the University Libraries own appropriate 
hardware; 

• be sufficiently "user-friendly" in that it provides or permits design of such amenities as 
introductory screens, on-screen tutorials, prompts and menus, function-specific help, novice and 
expert searching levels, helpful error messages, ease of exiting from one point in the database; 

• be evaluated in light of other potential acquisitions, and weighed against acquisition priorities for 
other formats; 

• provide improved access to or be an enhancement or enrichment of current library collections; 

• reflect the excellence, comprehensiveness, and authoritativeness expected of materials in other 
formats; 

• have adequate documentation available; 

• not require an excessive amount of staff time to provide adequate use of the media — individual 
cases may be negotiated with faculty; 

• avoid duplication of print or microform holdings unless the electronic resource is not archived 
satisfactorily (does not provide for permanent retention) or provides better access to the 
information, or when different formats meet different needs for different user groups; 

• be broadly accessible under current copyright laws and licensing agreements. 




42 



39 



University at Albany, SUNY 



V. Other Factors 

Electronic media is housed in various areas of the University Libraries. However, the primary location 
for interactive media, video and audio formats is the Interactive Media Center. The Reference and 
Government Publications areas of both the University Library and the Dewey Library house or provide 
access to electronic media used to assist patrons with reference queries. The Technical Services and 
Collection Development Divisions maintain electronic media needed by them for acquisitions and 
processing of library materials. 

November 1997 

fReturn to Table of Contents! 
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If; UBC Data Services 

A (very) brief overview 

I; What are data files anyway? 

If The codebook 

Micro- and macrodata 
View a data file 

What is the central UNIX service? 

Getting UNIX access 
Data files available on UNIX 
Getting help with UNIX 

What data services are available? 



What data files are avilable? 

Data Services' sources 

How do I get the numbers I need? 

The steps to extracting data 

- studying the codebook 

- identifying variables 

- which statistics package 

- finding file names 

- signing on to UNIX 

- setting up an extraction job 

- collecting your data 

What about data not at UBC? 

: . Statistics Canada data 
ICPSR data •; 

The Roper Center : 

What do I need to use the data 
files? 



Need helo with statistical 




Numeric Data Services at UBC: An Introductory 
Guide 



Data Services has provided numeric data services to the 
UBC campus since 1972. In this Guide we describe our 
services and collections, using detailed examples. If you've 
never used data files before, we hope the Guide will give 
you a better understanding of what data files are: and how 
you can use them. 

Our primary user groups, and hence our data file holdings, 
are in the social sciences and in commerce and business 
administration but some of the more general files, such as 
CANSIM or the Census of Canada, are of interest to 
researchers from many other parts of campus as well. 

We hope you find the Guide interesting and informative. Do 
contact us at Data Services if you have any comments or 
questions. 
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Statistical Sources 



Electronic Sources 


Print Sources 


Library of Congress Call 






Number Sequences 



ELECTRONIC SOURCES 

• Academic Universe (CSU affiliates only) 

Follow path "Reference7"Country Profiles" for handy statistics on individual countries. 

• Census Bureau 

Includes 2000 census and much other data. 

• Centers for Disease Control 

Includes data from the National Center for Health Statistics. 

• Central Intelligence Agency (CIA) World Fact Book 
Brief up-to-date statistics on individual nations. 

• Colorado bv the Numbers 

Statistics pertaining to the state of Colorado. Begin with the "Table of Contents," 
which is arranged by subject. 

• Colorado Home Pane 

Has links to state agencies, which sometimes mount statistics on their sites. 

• Colorado State University Office of Budgets and Institutional Analysis 
Includes the "Fact Book" and other institutional data in PDF format. 

• County Information Service 

Many links to statistics pertinent to Colorado counties. 

• FactSearch (CSU affiliates only) 

"Snapshot" statistics on policy issues, drawn from magazines, newspapers, and 
government publications. 

• Fed Stats 

Links to 70 federal government agencies, including the Census Bureau, Office of 
Management and Budget, and agencies dealing with economic analysis, labor, justice, 
transportation, energy, the environment, agriculture, and education. 

• FisOnline (CSU affiliates only) 

Click on "Country Profiles" for extensive statistics on individual countries. Very 
similar to the CIA World Factbook. 

• Food and Agriculture Organization of the United Nations 

Includes FAO databases dealing with agriculture, nutrition, fisheries, forestry, and food 
quality control. 

• Larimer County Compass 

Statistical data for Larimer County and Fort Collins. 

• NASI RE State Search 

Provides links to agencies in all 50 states; arranged into 32 topical categories. 

Provision of statistics varies widely from state to state and agency to agency. 

• Oregon State University Government Information Sharing Project 
Demographic, economic, and education data drawn from Census Bureau files. 
Especially useful for state, county, and school district data. 

• Regional Economic Information System 

Data for regions, states, counties, and metropolitan areas for 1969-1998. Compiled by 
the U.S. Bureau of Economic Analysis. 
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• STAT-USA (CSU affiliates only) 

Provides access to the National Trade Data Bank, including statistics, market research, 
and country analysis. 

• Statistical Resources on the Web 

This University of Michigan site provides numerous links to foreign government 
statistical agencies, foreign trade, and foreign economics. 

• Statistical Universe (CSU affiliates only) 

Abstracts and some full text of statistics compiled by federal agencies, often 
reproduced from print sources. Links to other statistical sites. 

• United Nations Web Site Locator 

Links to approximately 90 UN organizations appear in the "Alphabetic Index;" links to 
another 30 agencies can be found in "Other International Organizations." Provision of 
statistics varies by agency. 

• U.S. Census Bureau International Data Bank 
Demographic statistics arranged by topic and country. 



PRINT SOURCES 

Print Sources: Federal and State Agencies 

• Agricultural Statistics A 1.47: [year] Doc Ref 
Annual. 

• CIA World Factbook. PREX3:15 Doc Ref 
Annual. 

• Colorado Vital Statistics. HA271 .A3 Doc Ref 
Annual. 

• Crime in the U.S. [Uniform Crime Reports]. J 1.1 4/7 Doc Ref 
Annual. 

• Digest of Education Statistics. ED 1 .326: [year] Doc Ref 
annual. 

• Economic Report of the President. PR39.9:[year] Doc Ref 
Annual. 

• Public Land Statistics. 153.1/2: [year] Doc Ref 
Annual. 

• Statistical Abstract of the U.S. C3: 134: [year] Doc Ref 
Annual compilation covering many topics. A useful starting point. 

• Survey of Current Business. C59. 1 1 : Doc Ref 
Published monthly. 

• Transportation Statistics. TD 12.1: [year] Doc Ref 
Annual. 

• U. S. Industry and Trade Outlook. C6 1 :48 : [here] Doc Ref 

Print Sources: International Agencies 

• Human Development Report. HD4904.7 .H856 Ref 
Annual statistics on social indicators by country. 

• International Labor Organization. Yearbook of Labour Statistics. HD4826 .163 Ref 
Annual. 

• International Monetary Fund. Balance of Payments Statistics Yearbook. HF 1014 .153 
Ref 

Annual data by country and region. 

• International Monetary Fund. Direction of Trade Statistics Yearbook HF91 .U473 
Ref 
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Annual data on exports and imports. 

• International Monetary Fund. International Financial Statistics. HG61 .157 Ref 
Four pages of annual data for each country. 

• International Monetary Fund. International Trade Statistics Yearbook. HF91 .U473 
Ref 

Detailed annual data on exports/imports by commodity. 

• UNESCO Statistical Yearbook. HA 12.5 .U63 Ref 
Annual data on education, science, and culture. 

• United Nations Demographic Yearbook. HA 1 7 .D45 Ref 
Includes both population and vital statistics. 

• United Nations Statistical Yearbook. HA 12.5 .U63 Ref 
Includes population, economy, and economic activities. 

• World Bank. World Development Indicators. HC59.1 5 .W656 Ref 
Annual data on demographics, the environment, the economy, and markets. 

• World Bank. World Development Report. HC59.7 .W659 Ref 
Annual data on developing countries. 



Print Sources: Commercial Publishers 

• ACCRA Cost of Living Index. HD6983 .A4 Ref 
Quarterly data for 300 U.S. urban areas. 

• Book of the States. JK2403 .B6 Ref 
Annual comparative data in many categories. 

• Business Statistics of the United States. HC101 .B97 1999 Ref 

• City Crime Rankings. HV6787 .C58 Ref 
Updated periodically. 

• County and City Extra. HA203 .C68 1998 Ref 
Statistics on counties and cities. 

• CQ's State Fact Finder. HA203 .C85 Ref 
Useful annual comparative data. 

• CRB Commodity Yearbook. HC14 .C6 Ref 
Annual. 

• Crime State Rankings. HV6787 .C733 Ref 
Annual. 

• Education Statistics of the United States. LA201 .E38 1999 Ref 

• Handbook of US. Labor Statistics. HD8064 .H38 1999 Ref 

• Illustrated Book of World Rankings. HA 155 .K87 1997 Ref 
Comparative data among nations. 

• National Safety Council. Injury Facts. HA2 1 7 .A4 Ref 
Annual. Formerly called Accident Facts. 

• NEA Almanac of Higher Education. LA227.3 .N37 Ref 

• New York Stock Exchange Fact Book. HG457 1 .N6 Ref 
Annual. 

• Source Book of ZIP Code Demographics. HA203 .S66 Ref 
Best source for data by ZIP code. 

• Standard and Poor's Industry Serveys. HC 1 06.6 .S74 Ref 
Quarterly data on industries and leading companies. 

• Statistical Abstract of Latin America. HA935 .S8 Ref 
Annual. 

• World Population Data Sheet. HB848 .P65a Ref 
Annual. 



45 

o 

ERIC 



50 



Colorado State University 



Print Sources: Historical Statistics 

• Datapedia Of the United States, 1790 ~ 2000. HA202 .K87 1994 Ref 

• Historical Statistics of The States of the United States. HA2 1 4 .D63 1 993 Ref 
Some data on cities also. 

• Historical Statistics of the United States. C3 134/2 .H62 Doc Ref 

• International Historical Statistics: Africa, Asia and Oceania. HA4675 .M55 1998 
Ref 

• International Historical Statistics: Europe, 1750 — 1993. HA1 107 .M5 1998 Ref 

• International Historical Statistics: the Americas, 1750— 1993. HA 175 .M55 1998 
Ref 

• Value of a Dollar, 1860 - 1999. HB235 .U6V24 1999 Ref 
Historical prices in the United States. 

• World Population History. HB851 .M32 1978b Ref 



Print Sources: Bibliographies and Guidebooks 

• American Statistics Index. Z554 .U5 .A46 1998 Doc Ref 
Indexes and describes publications of the federal government. 

• Encyclopedia of Business Information Sources. HF535 1 .E53 Ref 

Updated periodically; useful for sources for industry and trade associations data. 

• Index to International Statistics. Z755 1 .163 Ref 

Indexes and describes publications of international organizations. 

• Major US. Statistical Series. HA37 .U55S77 1992 Doc Ref 
Listings and definitions of federal statistical series. 

• Statistics Sources. Z7551 .S84 19th 1996 Ref 

A listing by subject. Especially useful for sources for country data. 

• Understanding the Census. HA201 1990 .L3 8 1996 Doc Ref 
Excellent explanation of the U.S. Census and its publications. 



LIBRARY OF CONGRESS CALL NUMBER SEQUENCES: 

• International statistical compilations: HA12.5-HA42 

• General statistical compilations: HA 155 

• United States statistical compilations: HA 175— HA730 

• Statistical compilations for other continents and countries: HA740-HA4737 




Resources by Subject 
Content: Doug Ernest 
Last updated: 07/31/01 

URL: http://manta.library.colostate.edu/research/statsrc.html 

A-Z Index 

Colorado State University Libraries ^ Colorado State University 
Library Catalog ( SAGE) ^ Databases ^ Interlibrarv Loan ^ Search the Web <> Help 

Disclaimer & Copyright Statement 
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Duke University 



http://docs.lib.duke.edu/data/ 



Duke University 



Perkins Library Home Catalog Databases E-Re*orves Search 




Public Documents & Maps 

State f Mops & GlS T Palp I" International 



Hours 

Staff Data Services offers support in locating, extracting and formatting data from 

Contact different sources in a variety of formats. Statistical software packages and data 

translation software are available on two separate workstations. 



Census 2000 
ICPSR at Duke 
D ata Re sources 



Duke Only! 



Duke Only! 



Duke On M 



Current Index to Stats 
Statistical Univeige 
Source OECD 
S t at-USA 



OECD Statistical Compendium 
Int'l Financial Statistics (IFS) 
World Development Indicators [pdf] 
Citation S t yle s foj-Dala 
Citing Electronic Resources 

Research Guides 

He lfi £ £Q jar 



Electronic Statistics Textbook 

Hy p e r S te l 

Statistics Glossary 



ICPSR at Duke 

Social Sciences Computing 

Statistical Consulting Center 



All Class Resource Guides 



United States 

Census 



In December 1975, Congress passed Public Law 94-171 
which specifies that within one year of Census Day, the 
Census Bureau must send each state the data that could 
be used by the state to redraw federal, state and local 
legislative districts. Raw Census 2000 redistricting data 
are now available in directories for individual states on 
the Census Bureau FTP site. To facilitate exporting into spreadsheet or database 
software, the files are segmented so that individual files will not have more than 
255 fields. Each state directory thus has three data files, reflecting three data 
segments (Geographic Header file, FileOI (Tables 1 and 2), File02 (Tables 3 and 
4)). To get the complete data set for the redistricting files, users must FTP all 
three files in the state directory. Chapter 7 of the Technical Documentation 
[PDF] provides information on how to link these files. The files are provided in 
compressed (ZIP) format and need to be uncompressed before they can be used. 
Once uncompressed, the data are in a flat ASCII format. The geographic file is 
in a fixed-field format; the two data files are in comma delimited format. Files to 
facilitate manipulation in SAS, xBase, Access 97 and Access 2000 are also 
provided. Consult the File Structure Read Me file for more information. 



The Office of Management and Budget (OMB) has completed its Metropolitan 
Area Standards Review Project (MASRPE This review, which began early in the 
1990s and continued through the fourth quarter of 2000, has culminated in the 
publication of new standards for the coming decade. The Metropolitan Area 
program has provided standard statistical area definitions for 50 years. In the 
1940s, it became clear that the value of metropolitan data produced by Federal 
agencies would be greatly enhanced if agencies used a single set of geographic 
definitions for the Nation's largest centers of population and activity. Prior to that 
time, Federal agencies defined a variety of statistical geographic areas at the 
metropolitan level (including "metropolitan districts," "industrial areas," "labor 
market areas," and "metropolitan counties") using different criteria applied to 
different geographic units. Because of variations in methodologies and the 
resulting inconsistencies in area definitions, one agency's statistics were not 
directly comparable with another agency's statistics for any given area. OMB’s 
predecessor, the Bureau of the Budget, led the effort to develop what were then 
called "Standard Metropolitan Areas" in time for their use in the 1950 census 
reports. Since then, comparable data products for Metropolitan Areas have been 
available. 



The Statistical Consulting Center (SCC) in the Institute of Statistics and 
Decision Sciences is a statistical consulting service available at no cost to 
members of the School of the Environment and the School of Arts and 
Sciences. The SCC provides statistical advice and assistance with short-term 
statistical analysis to students, researchers and faculty. This includes assistance 
with designing experiments, studies and surveys, as well as analyzing data. The 
SCC seeks to develop longer term, collaborative research efforts and promote 
cross-disciplinary interaction between ISDS faculty and students and their 
counterparts in other disciplines. 



Sea rch Public Documen t s & Maps Web 

Search I 



Duke | Perkins Library | Public Documents & Maos | Catalog | Databases | E-Reserves | Search 



Location: http://docs.lib.duke.edu/data/index.htm 
Last update: 09/20/01 



Christof Galli £ 

Public Documents & Maps Department 



Perkins Library - Duke University 
Durham, NC 27708-0177 
U.S.A. 
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University of Massachusetts 



http:/ 7www.library.umass.edu/ govdocs/ cdguides/index.shtml 



Government CD-ROM user guides 

The Government Documents Collection at the University of Massachusetts Amherst has a large collection 
of CD-ROMs, produced by federal government agencies, containing statistical reports, research reports, 
and compilations of laws and regulations. 

CD-ROMs are included in the library's online catalog and are kept in the Documents Office on Floor 6. 

Most, but not all, federal government CD-ROMs include software for easy access to the information on the 
CD. 

In addition to the user guides listed here, technical documentation for CDs is either in paper or on the CD. 

CDs can be checked out, with some exceptions. They can also be used in the Government Documents 
Office; call ahead for availability (413-545-2765). 

Government CD-ROM User Guides 

Agricultural Statistics 1998 
Alabama Coastal Hazards Assessment 
American Community Survey 

Appropriations. Budget Estimates. Etc. [104th Congress, 1st session] 

Basic Course in Occupational Medicine 

Behavioral Risk Factor Surveillance System 1984-1995 Survey Data 
Behavioral Risk Factor Surveillance System 1996 Survey Data 
Budget of the United States Government 

Catalog of Federal Domestic Assistance / Federal Assistance Award Data System 

CDP File 

Census Mapper 

CensusCD + Maps 

Coastal Change Analysis Program 

County Business Patterns 

Crime in the United States [Uniform Crime Reports! 

Defense Science and Technology Strategy and Plans 
Energy InfoDisc 
The Ethics CD-ROM 

Gamma Ray Observatory: NASA PPMI: "Lessons Learned" 

Green Chemistry Expert System 
HCFA's Laws Regulations Manuals 

ICD-9-CM: International Classification of Diseases, Ninth Revision. Clinical Modification 
LandView III 

Library of Environmental Images 
MMWR 

National Trade Data Bank (NTDB) [November 1999+1 

National Trade Data Bank (NTDB) fpre-November 19991 

Navy Advancement Center TRAMANs and NRTCs 

1994 National Home and Hospice Care Survey 

1996 Natality Data Set 

1999 Federal Tax Products 

1998 Federal Tax Products 

1998 Massachusetts Tax Forms and Publications 

Occupational Outlook Handbook 

Q*NET: The Occupational Information Network 
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University of Massachusetts 



OTA Legacy 

Partnership for Family Involvement in Education 
Photovoitaics 1996-1997 

Schools and Staffing Survey (SASS) and Teacher Followup Survey (TFS) 

State Personal Income 1929-1997 
TIGER/Census Tract Street Index Version 3 
The United States Army in World War I 
U.S. Exports of Merchandise 
U.S. Imports of Merchandise 
USA Counties 
USA Trade 

USDA Nutrient Database for Standard Reference, Release 12 

Veterans Benefits Administration Automated Reference Materials System (ARMS) 

Volcanoes of Indonesia 



These user guides provide: 

• a brief description of the CD-ROM 

• information about system requirements 

• installation instructions 

• search tips 

• references to equivalents in other formats or to related information 
For other sources of information about using government CD-ROMs, check: 

• the flyer in the jewel case of a particular CD-ROM 

• the readme file on the CD-ROM (often found on the root directory) 

• CD-ROM Technical Documentation Project 

• GODORT Handout Exchange 




UMASS 



© 2000, University of Massachusetts Amherst Libraries . Updated: February 28, 2001 
This is an official publication of the University of Massachusetts Amherst, 

For comments or suggestions concerning this page, please contact William Thompson 
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University of Michigan 




Numeric Data Services 
University Library 

The University Library’s new Numeric Data Service 
helps meet your data needs 

The Numeric Data Service helps you to: 

=> access ICPSR data 
=> identify & locate a data set 
=> interpret documentation 
=> transfer files 

=> import data into statistical packages 
=> support class instruction 

=> access data from the Roper Center, DRI or other 
data providers 

You can use the Numeric Data Service three ways: 

In person at 203 J Hatcher Library North, 
Monday - Friday, 1 :00 - 3:00 p.m. 

Via email at numeric.data@umich.edu 

On the web at 

<www.lib.umich.edu/libhome/data> 

Numeric Data Service means more time researching! 

Start using it today and watch for 
further developments ! 





University of Michigan 



Introducing a New University Library Service 

The University Library is launching a new Numeric Data Service to aid you in teaching 
and research. As an initial step, we have initiated an agreement with the Harvard/MIT Data 
Center to improve our campus access to data files from the Inter-university Consortium for 
Political and Social Research (ICPSR). Through the Harvard/MIT Data Center, U-M faculty, 
students, and staff can search, order and retrieve ICPSR data at any time. For selected 
datasets, the Harvard system can perform cross tabulations or subset data. The URL for 
access to ICPSR data is <www.lib.umich.edu/libhome/data/icpsraccess.html> 

The Library’s Numeric Data Service makes it easier for the U-M community to: 

=> identify & locate a dataset 
=> interpret documentation 
=> transfer files 

=> import data into statistical packages 

=> acquire & store datasets for class instruction 

=> access data from the Roper Center, DRI or other data providers 

Additional services will be added as we grow. Numeric Data Service is available 

=> in person, Monday — Friday, 1 :00 — 3:00 p.m., 203 J Hatcher Library North 

=> via email at numeric.data@umich.edu 

=> on the web at <www.lib.umich.edu/libhome/data> 

The University Library is also beginning development of local capability to manage 
and make data sets accessible via the web. Work is underway, using software acquired from 
UC-Berkeley, to develop local services for data extraction, statistical applications, and data 
subsetting. If you have datasets that you would like to suggest for campus deployment, 
please let us know. We are also actively engaged in efforts to develop campus partnerships 
to increase the availability, improve access, and ensure sustained management of data sets 
and welcome your interest in collaboration as well. 

Please pass this information along to colleagues who would benefit from this new 
service. For more information, contact JoAnn Dionne at numeric.data@umich.edu or 
j dionne@umich.edu . 
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University of Virginia 



http://fisher.lib.virginia.edu/ reference/ guides.html 




1 CATALOGS* > 
1' COLLECTIONS 


INTERACTIVE ^ 
DATA 


REFERENCE ^ 
RESOURCES | 


CLASS ^ 

RESOURCES 1 


USER ^ 

PROJECTS 


STAFF & ^ 

SERVICES 


j GEOSTAT: GEOSPATIAL & STATISTICAL DATA CENTER 


UNIVERSITY OF VIRGINIA LIBRARY | 



Reference Resources 



ERIC 



User Guides 



A ® 



indicates a resource developed outside the University of Virginia Library 

Address Matching in ArcView 

A guide to using ArcView to match addresses. Offers tips on how to format address 
information, appropriate types of street data, and other information about geocoding 
addresses. 



CensusCD+Maps 

CensusCD+Maps is a product which combines demographic data and software to 
provide easy analysis of all census geographic breakdowns for the United States. 
GeoLytics, Inc. has created this product by using 1990 Census Data and adding 
software which lets y ou view and output reports and maps quickly and easily. 

CensusCD+Maps ^ 

Guide developed at Electronic Data Services at Columbia University; largely 
reproduces on-line help on Census CD disc. 

Community 2020 Tutorial 

Community 2020 is a geographic and statistical data resource produced by the 
Department of Housing and Urban Development (HUD) to help community 
development personnel identify, catalogue and keep track of grants and programs 
administered by HUD. It conta ins a large a large amount of information regarding 
these programs. It also has a large amount of 1990 statistical information that is 
attached to geographic files for a large number of areas: block groups, census tracts, 
cites, MSAs etc. Community 2020 is available on every PC in the Geostat lab. 

CPS Utilities 

The Bureau of the Census states that, "The Current Population Survey is a monthly 
survey of about 50,000 households. ..[which] has been conducted for more than 50 
years. The CPS is the primary source of information on the labor force characteristics 
of the U.S. population. The sample is scientifically selected to represent the 
noninstitutional population. ..[It] provides estimates for the nation as a whole and 
serves as part of model-based estimates for individual states and geographic areas." 
(See the Bure au of the Census, CPS) This document will outline the particulars of 
running CPS at EDS as well as giving a brief overview. It is not a complete guide to 
using the CPS Utilities Database. 

International Financial Statistics 

The International Financial Statistics is the principle publication of the International 
Monetary Fund. The IFS contains data from 1948 until 1997. The IMF reports that the 
CD-ROM "provides users with time series data covering approximately 26,000 
economi c concepts which show major economic aggregates used in the analysis of 
economic developments and includes data on the following topics: balance of 
payments, banking and financial systems, employment, exchange rates, Fund 
position, government finance, int erest rates, international liquidity and banking, 
national accounts, population, prices, production and trade." (see IFS Coding 
Documentation) ...... . ... 
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University of Virginia 



National Longitudinal Surveys vy ; 

This User's Guide illustrates how-to begin using the Windows version of the data, base.. ; 
access software NLS Investigator. This retrieval system accompanies one or more of 
the National Longitudinal Survey's (NLS) databases on CD-ROM. With it you can t 

extract data and prepare that data for statistical analysis. You can create, modify, and 
save your own personalized, extracted files for further research. This package is easy ! 
to use. If you already have some knowledge of the NLS and a search strategy in 
mind, yo u will be able to complete that search and extract a file within minutes of j 

using the guide. Just follow this guide and substitute your own strategy for the 5 

examples given. 

Sources for Poiiticai Survey Data 



Public Use Microdata Samples ( MicroAnalvst Version) s 

The PUMS "(Public Use Microdata Samples) data are 1% samples (5% samples are 
available for 1980 and 1990) of households enumerated by the Decennial Census 
between 1850 and 1990. In contrast to what is presented in the ?official? Census | 
reports, these data provide data on individual households and the individuals residing j 
in these households. | 

! 

The PUMS data include information on a broad range of population characteristics, j 
including fertility, marriage, life-course transitions, immigration, internal migration, j 
labor-force participation, occupational structure, education, ethnicity, and househo Id 1 
composition. j 



SAS Support at the University of Virginia | 

SAS is a powerful program used by many banks and corporations worldwide. It is very 
powerful, and is particularly useful for dealing with large data sets. . i 

Getting Started with SPSS for Windows I 

Tutorial for SPSS for Windows 

SPSS Support at the University of Virginia ® 

SPSS is a widely used, user-friendly program that should accomplish most (if not all) 
of your statistical computing needs. 

Trans-Atlantic Slave Trade CD-ROM j 

This CD-ROM contains the records of over 27,000 slave trade voyages between the j 
years 1527 to 1866. This user guide provides a quick overview of the query software 1 
and provides a detailed search. % j 

t 

v 

Union Army Study ! 

The Union Army Data Set consists of 35,747 white males from 303 randomly selected j 
Union Army companies during the Civil War, for whom military, socio-economic, and j 
medical information from the several sources throughout their lifetimes has been j 

collected. This data set was compiled by the Center for Population Economics. 'j 

3D Topoquads ! 

3D Topoquads allows users to search for and view digital topographic maps of Virginia 
by feature and place name, print portions of the map, or save it to the file. Despite 
the name 3D Topoquads does NOT work in 3D in the Geostat Lab. Only Virginia maps | 
are included. 




Electronic Centers | Library Home 

LAST MODIFIED: Thursday, June 21, 2001 
THIS SITE MAINTAINED BY: qeostat@virginia.edu 
© Copyright The Rector and Visitors of the University of Virginia 
University of Virginia / Charlottesville, Virginia / 22903 
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Yale University 



http://statlab.stat.yale.edu/SSDA/helpfiles/nt.html 



Social Sciettc& Data/ Archive 

Help Using SSDA Data Files on the STATLAB Network 



Using ASCII data files with SAS 
Using ASCII data files with SPSS 
Using SAS Transport files 
Using SAS System files 
Using SPSS Export files 
Using SPSS System files 

Using OSIRIS files: converting with Stat/Transfer 
Using Dbank with Balance of Payments data 
Using Dbank with Direction of Trade data 



Introduction 

The instructions and examples in the links above explain how to use data files from a computer directly connected to 
the Statlab NT Server. 

Using ASCII Data Files (.dat or .raw files) 

Column-delimited ASCII files (usually with the file extension .dat or .raw) can be read by the statistical packages on 
the Statlab NT Server. The sections listed above provide information for reading ASCII files in SAS and SPSS, both 
for Windows. 

Using System Data files 

The system data files available are ready for analysis using STATLAB software, with little additional data 
management. The links above provide instructions for accessing system data files in SAS and SPSS. If the analysis is 
designed to use other software, these data files are readily convertible to other data formats using the file conversion 
software DBMSCOPY available also at the STATLAB. 



[ Statlab Home Page | SSDA Search | SSDA Home ] 



Yale University Social Science Statistical Laboratory 
comments: stafhe/r/a.ra/e. edu 
Revised: September 8, 1999 
Web Site: http://statlab.stat.yale.edu 
Copyright© Yale University, 1996 
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SPEC Kit 263 

Numeric Data Products and Services 



Data Service Descriptions 



Emory University 



http:/ / einstein.library.emory.edu/ About.shtml 





Electronic Data Center j 



Introduction! 



Electronic Data Center was established in 1996 to support quantitative research in the Social Sciences by 
members of the Emory Community. Located in the Technology Centers (Room 217) in the Robert W. 
Woodruff Library , the Data Center provides researchers at Emory with access to a broad collection of 
numeric data, workstations with statistical and mapping software and a knowledgeable staff with expertise 
in data development and quantitative research. The Electronic Data Center at Emory University is one of a 
handful of full-service Data Centers among universities in the United States. 

jOur Services; 

Emory's Electronic Data Center provides researchers and students with access to over 5000 data sets 
covering diverse subjects such as Criminal Justice, Urban Renewal, Health Policy, International Relations, 
Political Behavior and Economic Development. Interested Emory Researchers may access our online 
collection anytime or request that we acquire data from ICPSR , the US Government , and many other data 
providers . In some instances users can even analyze the data online using applications developed in 
partnership with Emory's Information Technology Division . We also provide faculty and students with a 
cluster of high-end workstations equipped with the latest software applications to facilitate their research. 

The staff of the Electronic Data Center are active researchers with a strong interest in supporting the use 
of data and statistics analysis in research. We provide consulting and instruction to students and faculty at 
Emory regarding issues such as survey design, business forecasting and statistical programming. Our 
services have been utilized in support of research papers, theses, faculty publications, government reports 
and business analyses. We are active in professional organizations and on occasion we have provided 
support for local groups, emeritus faculty and alumni in need of our expertise. 



Home | The Data Center l Services | Contacts & Hours | Emory General Libraries 



©2001 Emory University 

Question or Comments? Contact: The Electronic Data Center 
About this web site 




Harvard University 



http://data.fas.harvard.edu 
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New.*," 



Feature 



Overview 



[The Harvard-MIT Data Center is the principal distributor of quantitative social science data at Harvarc 
[ University and the Massachusetts Institute of Technology . It serves as the universities' official 
[representative to the Inter-University Consortium for Political and Social Research (ICPSR); as a 
[representative (and/or repository) to Roper Center for Public Opinion Research (searchable through 
[Lexis-Nexis), and the National Center for Health Statistics (NCHS); provides access to Murray Center 
[ student datasets . Social Sciences Program Data ; HMDC's Guide to Census Microdata and is a central 
[contact point for many other archives and data suppliers. We maintain a large library of electronic date 
|from all these sources, a growing collection of unique data sets, and an extensive codebook library. 

: ew other university data centers approach the quantity of social science data that passes through here 
|on a regular basis. 

[he HMDC web site has been awarded the title "Best Political Science Research Web Site, 1999" by 
the American Political Science Association (CMS). Many features of this site are unique, such as over 
[five hundred online subsettable data sets , sophisticated browsing and searching options, automatic 
[ordering of new data sets from other repositories, and interactive data exploration. We encourage our 
[affiliates to take advantage of them all. 

ou may be interested in the history of the Data Center and our latest developments and innovations. . 
|Our largest project currently is the creation of a data center ( VDC) — a free, open, portable, 
self-installing data server, using the protocols for communication and searching from our ongoing 
Digital Libraries Project, and our recently completed Record of American Democracy project (the 
largest collection of combined census and electoral data set ever created). The VDC project will 
|seamlessly integrate independent virtual data center sites around the world and make data sharing and 
listribution easy and transparent. 



USJE.EL 
.iJNKS 



Facts 



The Data Center is located in the Government Department in Littauer Center North Yard. We also 
lave offices for our staff at CBRSS and WCF1A . We are always open (physically or virtually), but the 
iData Center staff are available to the public by appointment during the Summer, and during the fall 
and spring terms during office hours (except for official national, state, and Harvard holidays and 
recesses). Fellows may offer consultation at any time. We also operate several computer labs , with 
facilities for extracting, translating and statistically analyzing data. We look forward to your comments 
or suggestions . 



We wish to thank our supporters . 
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University of Manitoba 



http://www.umanitoba.ca/libraries/units/datalib/ 




ERIC 



65 



58 






Universtiy of Michigan 



http://www.lib.umich.edu/data/ 







Numeric Data Services provides: 

• Help identifying and locating a data set, interpreting documentation, help 
transferring files, help importing data into statistical packages. Web sites on 
Identifying Numeric Data and Using Numeric Data are available. 

• Access to ICPSR data: For more information on using ICPSR data see Access 
to ICPSR data for the University of Michigan. 

• Access to other data sources such as: DRI/Webstract, the Roper Center and 



• Support for class instruction: Web-based exploration and extraction for selected 
data sets; acquisition and storage of data sets. 




NCHS. 



Staff JoAnn L. Dionne, Social Science Data Librarian 



Graciela Mentz, Data Consultant 

Abigail Leah Plumb, Data Consultant 

Hong Zieske, Numeric Data Programmer Analyst 



Hours Monday-F riday, 1-3 pm 



Numeric Data Services will be closed 
September 4th for Labor Day. 



Location 203J Hatcher Library North 



Contact numeric.data@umich.edu 




m 



University of Michigan 

Questions or comments? Ask Numeric Data 

Last Update: 09: 18 PM EST on Wednesday, November 8, 2000 
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Universtiy of Pennsylvania 



http:/ /data.library.upenn.edu/ 



■ ' ■ I ^UNIVERSITY OF PENNSYLVANIA LIBRARY 

Index 

Quantitative and Geospatial Data Holdings 

; Franklin Catalog 

Databases ; , 



Services 

Data Holdings 
Philadelphia Data 
I Philadelphia Maps 
Data Archives 



I • ICPSR 

I • Roper 

• Sociometrics ; 

I • Census 

• IPUMS 

• FedStats 

• STAT-USA 

• Historical 

I Census 

! • Gov Info j 

Sharing ! 

! U.S, Census Guide 

I Web Resources j 

1 What's G1S7 * 

j Creating Census \ 

1 Data Map: ArcView 

1 Tutorial 

" \ 

1 

Lippincott 
CD-ROMs 
Annenberg 
CD-ROMs 
i Health Statistics 

j Campus Data 

| Centers 

! 

I 

j Library Home 
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To support the research needs of Penn faculty, students and staff, the University of 
Pennsylvania Libraries acquire and subscribe to numerical and geospatial data resources 
in print and electronic formats. The library is a federal depository library, and receives 
large amounts of numerical data from agencies. Many of these resources are available 
on CD-ROM as well as on the web. Some of the agencies are: 

• Census Bureau 

• National Center for Education Statistics 

• National Center for Health Statistics 

• Bureau of Labor Statistics 

• FBI— Uniform Crime Reports 

• Environmental Protection Agencv:Envirofacts . 

The library is also a repository of United States Geographic Survey maps that are 
housed at the High Density Storage Facility, Suite #10, 3001 Market Street. This 
collection is not cataloged in Franklin. Please consult Collection Development Policy: 
Maps regarding this collection. 

The library's numerical and geospatial collections have a special emphasis on 
Philadelphia data and maps . Recently, the Penn Library has digitized historical 
Philadelphia census tract maps using ArcView. These maps and the electronic census 
data build a foundation for researchers to study demographic changes that have occurred 
in smaller areas of Philadelphia over the past sixty years. 

The library is a subscriber to some major numeric data archives, such as ICPSR, Roper 
Center for Public Opinion Research and Sociometric Social Science Electronic Data 
Library. The Numeric Data Archives web page provides information on obtaining 
datasets from these archives. 



The Library also holds uncataloged or incompletely cataloged U.S. census resources in 
print, microform, and CD-ROM. Much 1990 Census data is available online. Access to 
these materials will be discussed in this U.S. Census guide. 

The Library provides access to several major statistical databases through the Libraiy 
web site including 



BankScope 
Country Finance 
Ed gar Scan 
EuroMonitor 
FISonline 

I SI Emerging Markets 

MultexnetUn Statistics Division Common Database 

Polls & Surveys vie LEX1S-NEX1S Academic Universe. Clinck on Reference. 
Polling the Nations 

Social Science Electronic Data Library (SSEDL) 

Statistical Universe 
SourceOECD “ ’ * 
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• Standard & Poor's Industry Surveys 

• STAT-USA ~ ' 

• TableBase 

• Un Stats Common Database 



The Lippincott Library houses a collection of demographic and marketing statistics on 
CD-ROMs . In addition, Lippincott library also subscribes to international financial 
resources, such as DataStream. 



Franklin , the online catalog, is used to locate the library's holdings of print and 
CD-ROM publications. These resources are housed in: 



• Van Pelt Library-’ 

• Lippincott Library 

• Fisher Fine Arts Library 

• Biomedical Library 

• Annenberg Library 



To find these resources and their locations, please use the Franklin keyword search for 
terms such as: 



franklin 

express! 



find 



ortywtare in 

Title R ft' 



statistics AND 
Philadelphia 
census AND 1970 
estimates AND building 
health AND statistics AND 
CD 



DOQ OR (digital orthophoto 
quadrangles) 

DOL OR (digital line graphics) 
DRG OR (digital aster graphics) 
USGS AND CD 



Or search by author for terms such as: 



i 



i 



Geological Survey 
(U.S.) 

United States. 
Census 

GeoLytics, Inc. 
CACI, Inc. 



National Center for Health Statistics 
World Bank 

Organisation for Economic Co-operation 
and Development 

United Nations. Dept, of Economic and 
Social Affairs 



home hours help IS libraries forms comments a-to-z patron search 

Copyright© 1998-2001 University of Pennsylvania Library. All Rights Reserved. 

Last update: Monday , 30-Jul-2001 19:36:59 EDT 
Send mail concerning this page to: geodata(d)pohox. upenn. edu 
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Data Services Policy 
Reference and Instructional Services 
University Libraries 
University of Tennessee 



Mission 

Data Services facilitates access to machine-readable numeric data files for secondary 
data analysis. Data Services helps researchers identify appropriate numeric data files, 
locates files in-house or through outside sources, and acquires numeric data and 
documentation files for the Libraries’ collections. 

Data File Identification 

Data Services offers consulting on the content and format of the Libraries’ machine- 
readable data holdings, as well as data files not held by the Libraries. Reference 
assistance in identifying appropriate research studies and data files is extended to 
University of Tennessee faculty, staff, and students, and non-University persons at no 
charge. Service to researchers affiliated with the University of Tennessee, Knoxville 
has priority over service to all other patrons. 

Data File Availability and Access Information 

Data Services attempts to locate data and documentation files identified by the 
researcher. Information on how to access or order the files is provided to the user, 
regardless of his or her affiliation. Service to researchers affiliated with The University 
of Tennessee, Knoxville has priority over service to all other patrons. 

Data File Acquisition 

Data Services acquires requested data and documentation files available through 
consortia or organization memberships (e.g., ICPSR) and no-fee data providers (e.g., 
U.S. government statistical agencies) for University of Tennessee, Knoxville faculty, 
staff, and students at no charge. Data products from commercial or fee-based 
organizations may be purchased by the Libraries if funding is available and the cost is 
reasonable and justified. Data Services cannot guarantee the purchase of any data 
file(s) requested by the patron. 

Data Services does not download, order, purchase, or otherwise acquire data files for 
patrons not affiliated with the University of Tennessee, Knoxville. 
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Preparing Data Files for Use 

Data verification and manipulation of data files (as described below) is limited to 
University of Tennessee, Knoxville patrons. If time and resources permit, Data Services 
may be able to answer questions from non-University patrons about data files they 
acquired for themselves, but no action will be taken on the files. 

Data Services generally performs basic data verification tests on data and 
documentation files obtained from ICPSR and other sources. This includes checking 
the readability of a file and that the correct number of records and variables are present. 

Data Services can, on a limited basis, assist University patrons with the extraction of a 
subset of data for analysis. Patrons in need of complex subsetting or subsetting of very 
large data files are referred to the Statistical Consulting Center for assistance (200 
Stokely Management Center; 974-9900; dii.utk.edu/scc/). 

Statistical Analysis and Consulting 

Data Services does not provide computing or statistical consulting on research projects. 
For assistance with manipulating, summarizing, analyzing, graphing, mapping, or 
interpreting data, patrons are referred to the Statistical Consulting Center (200 Stokely 
Management Center; 974-9900; dii.utk.edu/scc/). 

Contacting Data Services 

The primary contact for answering general service questions or identifying, locating, 
and/or acquiring data files is: 

Eleanor Read 

Social Science Data Services Librarian 
ICPSR Official Representative 
120A Hodges Library 
(865) 974-0011 
eread@utk.edu 



This policy is subject to change as data products and/or computing and staff resources 
change over time. 



4/01 EJR 
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http: / / www.chass.utoronto.ca/ datalib/ 

Data Library Service 
University of Toronto 



Table of contents: 

• What’s new in the DLS 

• Finding Canadian, U.S. and international statistics 

• Finding Canadian research data 

• Collections 

° Major collections 
° Access conditions/restrictions 
° Search UT/DLS inventory of holdings 
° Census of Canada, 1996 

• Internet data resources 

• Sendees 

• Commonly asked questions 

• Contact us 



Training opportunities 

° Training opportunities 
° Sources of funding 
Miscellaneous 

o OCUL/CREPUO federation 
° CER1S data sets 

UToronto/Ryerson/York Research Data Centre 
CF1ASS home page 
C FI ASS databases 
DMGIS home pages: 

° Government Documents 
o Map Library 



Introduction 

The Data Library Service (DLS) at the University of Toronto was established by the University of Toronto Library 
and Computing Services (UTCS) in 1988. It is now wholly maintained by the Library, and is a part of the Data, Map, 
and Government Information Services (DMGIS). It supports empirical research and teaching activities of University 
of Toronto faculty, students and staff by collecting, managing, and making available research data in 
computer-readable form. 

By archiving data files collected by University researchers, and acting as a registry for non-depositing data owners, 
the DLS also functions as a central data archive, as well as a source of expertise in the long-term preservation of 
computer- readable information. 

The DLS does not provide access to bibliographic and other 'look-up' databases. Access to these resources is provided 
through the reference departments and branch libraries of the University of Toronto Library system. 



Objectives 

The objectives of the University of Toronto Data Library Service are to: 

• acquire, manage and preserve machine-readable data files needed to support empirical or statistical research 
and teaching activities of the University of Toronto, 

• provide access to machine-readable data files owned by the University of Toronto, 

• provide support for users of these machine-readable data files. 

Return to table of contents 
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Collections 

The collections of the DLS consist of numeric, spatial and textual research data files, primarily but not exclusively in 
the social sciences. These files contain quantitative research data, including microdata, aggregate data and time-series 
databases. 

To search for specific titles or principal investigators, etc., please search UT/DLS inventory of holdings 
Currently, major categories of holdings include: 

• Canadian census data (aggregate and microdata) from 1961, 1966, 1971, 1976, 198L 1986 . 199L and 1996 , 
from the Census of population and the Census of agriculture, as well as selected U.S. census of population and 
housing data, 

• microdata from selected surveys conducted by Statistics Canada and other government departments, 

• microdata from major social surveys conducted in Canada, the United States, Great Britain, etc., 

• public opinion polls from commercial pollsters, such as Canadian Gallup, CBC/Globe and Mail, Insight 
Canada Research, etc., 

• selected major socio-economic and financial databases from: 

° Datastream 

o DR1 McGraw-Hill databases such as COMPUS 1A 1 and Basic Economics 
o Center for Research in Security Prices (CRSP) 
o Financial Posfs Corporate database and Mutual funds database 
° CFMRC (formerly TSE/Westem) 
o International Monetary Fund’s 1FS database 
° OECD national accounts database 

• a small collection of textual tiles , and the University’s membership in ARTFL (Project for American and 
French Research on the Treasury of the French Language, University of Chicago ), 

• digitized map files, such as World Data Bank II, a map of the world, and digitized political/census boundaries 
for Canada. 

A few of our most popular time-series and aggregate databases have been provided with a WWW interface to 
searching and retrieving data, courtesy of CHASS, at: <http://datacentre.cha$s.utoronto.ca/> . 

In addition, the DLS is constantly acquiring new data from major disseminators such as: 

• Statistics Canada's Data Liberation Initiative [DHL 

• Inter-University' Consortium for Political and Social Research (ICPSR) , as a member of the OCUL/CREPUQ 
federation , and the 

• Roper Center 

as well as from other data archives and data libraries, and data disseminators around the world. 

Return to table of contents 
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Services 

The DLS offers a variety of services, including: 

• Codebooks: the DLS has copies of documentation describing the content and format of all data files in the 
DLS collection. Where possible, machine-readable codebooks are made available by searching the UT/DLS 
inventory of holdings . Many codebooks, however, are available only in print, but circulating copies are 
available in the UT/DLS, as are circulating printed copies of some of our computer-readable codebooks. 

• Acquiring new data fdes: identification and acquisition of new data files and/or related documentation 
needed for research and/or teaching. The DLS maintains contacts with other data archives and libraries, and 
can assist with searching for and getting copies of data resources available elsewhere, either via the Internet or 
through purchase (the budget allowing). The DLS maintains memberships in the following: 

° Statistics Canada's Data Liberation Initiative [DLJJ 
o Inter-University Consortium for Political and Social Research (1CPSR) 

° Roper Center for Public Opinion Research 

• Creating custom subsets: technical assistance in the identification and retrieval of data from large data files, 
including creating custom subsets of just the cases and variables needed for your research, in the software 
dependant format needed for your statistical package, 

• Transferring data between hardware/software platforms; 

• Statistical packages: help with the use of statistical software in the definition of data files; 

• Custom instructional data sets: construction of custom data sets for instructional or research use; 

• Archiving: cleaning, documentation, archiving, and dissemination of original data files deposited by 
researchers; 

• Citing data files: help with the composition of bibliographic citations for computer files. 

Return to table of contents 



Access conditions and restrictions 

The following restrictions are specified in the contracts that we have signed with individual data suppliers. 

Who may use the data: 

• Unless otherwise specified (in the UT/DLS inventory of holdings ), data in the DLS collection are for use by 
University of Toronto faculty, students and staff only . This restriction applies to on-line databases (CANSIM, 
DRI basic economics, IMF, OECD, WB etc.), all CD-ROMS available in the DLS, databases on diskette, as 
well as other data files in the DLS collection. 

• Some statistical products disseminated through the Data Liberation Initiative are also available to Ryerson 
Polytechnic University and York University IP addresses as well. Where materials are not accessible through a 
WWW server, faculty, students, and staff of these local universities should contact their local data service 
personnel (see Internet data resources:lndividual data arc-hives:Canada ). 

• Visiting faculty, visiting graduate students, affiliates, etc. , must bring a letter signed by their respective 

Department Chair or equivalent. This letter should confirm the visitor's status vis-a-vis the University of 
Toronto, request that the visitor be treated as a faculty, student or staff member (as appropriate), and the length 
of time that such status should be accorded. A copy of the letter must be provided to the Data Service Librarian 
for our files. ... •• 
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• Alumni, holders of corporate library cards, and external readers have no access to restricted data files in the 
DLS collection. Only data specifically designated 'public use' and data files which the DLS is contractually 
allowed to disseminate, can be made available to these users. 

Acceptable uses: 

Acceptable uses include academic research and/or teaching (and by the University administration for administrative 
planning) only. 

• University of Toronto faculty, students, and staff who access these data must not make copies of these data 
available, in whole or in part to colleagues who are not part of the University of Toronto academic community. 
Similarly, computing accounts at the University of Toronto should not be made available to non-University of 
Toronto persons for the purpose of accessing restricted data. 

• These data may not be used for commercial consultancy work, commercially funded research, etc. Such users 
should contact the Distributor of the file and make their own arrangements for acquisition of the data. 

• Use of microdata in any way to attempt to identify individual respondents is prohibited. 

When in doubt always consult DLS staff. This is one instance where it is not easier to ask for forgiveness than 
permission. Unauthorized use could result in the University of Toronto being permanently prohibited from acquiring 
data from major sources. 

The DLS is contractually allowed to disseminate a number of the data files in its collection. Copies of these data files 
can be made available to other users, on request, subject to the terms of individual deposit agreements. Contact DLS 

staff. 



Return to table of contents 



Miscellaneous: 

Database of data library/data archive related literature./comp. By Laine G.M. Ruus. 1978 - . 
http://prod.librarv.utoronto.ca/datalib/reference.html 

Internet-based resources for internet training./ by Rea Devakos & Laine G.M. Ruus. Nov. 1997 
http://www chass.utoronto.ca/datalib/misc/trainmat.html 



Social science research in Canada and federal government information policy: the case of Statistics Canada - 
executive summary./ Nilsen, Kirsti. (Ph.D. diss., University of Toronto, 1997). 
http://www.chass.utoronto.ca/datalib/misc/nilsen.html 

• Note: To obtain a copy of the full thesis text, contact K. Nilsen at nilsen@fis.iitoronto.ca . 

Return to table of contents 
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UNIVERSITY 
Of VIRGINIA 

Library 



Catalogs & 
Collections 

...ICPSR archive; 
map collections; 
geo-data 
clearinghouse... 

Interactive 

Data 



...retrieve and 
create custom data 
sets and maps... 

Reference 

Resources 

...code finders; 
map symbos; unit 
converters; other 
useful sites... 

Class 

Resources 

...instructional 
materials for UVa. 
classes... 

User Projects 

...Slave Trade; 
Salem Witchcraft; 
Victorian London... 

Staff & 
Services 

...Who we are and 
what we do... 

Site Map 
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http: / / fisher.lib.virginia.edu / 
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GEOSPATIAL & STATISTICAL DATA CENTER 



What We Do 

Geostat supports a wide range of academic and scholarly activities 
through access to extensive collections of numeric and geospatial data : 
files; computing facilities and software for data manipulation, research, j 
jand instruction; and a suite of Internet-accessible data extraction 
Tools. i 

NEW: 2000 National Election Study & 2000 
General Social Survey 

I ■ 1 

jThe final editions of the 2000 National Election Study Pre- and 
| Post-Election Survey data and 2000 General Social Survey data 
^have been released for researchers through ICPSR . [UVA ACCESS 
ONLY] 

NEW: Digital Sanborn Maps of Virginia 

The Virginia Digital Sanborn Maps collection of Bell and 
Howell/UMI provides University of Virginia users with access to most 
maps of Virginia communities produced by the Sanborn Map Company, 
from the 1880s - 1950. Digital images were produced from black and 
white microfilm of atlases deposited at the Library of Congress, or 
transferred there by the Bureau of the Census. [UVA ACCESS ONLY] 




ABOVE: Map of the Lawn in 1920 produced by the Sanborn Fire 
Insurance Company. 



I UPDATE: County Business Patterns 

i 

jThe County Business Patterns data base provides county, state, and 
Inational level business data for 1977-1998. 

| UPDATE: Regional Economic Information System 

iThe REIS database provides local area economic data for states, 
counties, and metropolitan areas for 1969-1999. 3 

Electronic Centers | Library Home 



LAST MODIFIED: Thursday, August 23, 2001 
THIS SITE MAINTAINED BY: oeostat@virainia,edu 
© Copyright The Rector and Visitors of the University of Virginia 
University of Virginia / Charlottesville, Virginia / 22903 
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Lab Hours 

The Geostat Lab is 
located on the 3rd 
Floor (West) of 
Alderman Library, and 
is open whenever 
Alderman is open . 

Staffing Hours 

Staff are available to 
assist patrons on the 
following schedule: 

Fall Semester: 

Mon - Thur: 

9am - 9pm 
Fri: 9am - 5pm 
Sat: 1pm - 5pm 
Sun: 1pm - 5pm 

Constant Schedu le 

Visit Geostat 

Directions and Maps 



Contact Us 

PH: 434.982.2630 
FAX: 434.924.1431 

qeostat@virqinia.edu 

US Mail: 

Geostat Center 
Alderman Library 
PO Box 400129 
Charlottesville, VA 
22904-4129 

Other Deliveries: 

Geostat Center 
Alderman Library 
160 McCormick Rd. 
Charlottesville, VA 
22904 
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http: / /tdr.tug-libraries.on.ca/overview.htm 

TDR ABOUT TDR 

i HELP J SEARCH J 1 TDR DATA : LICENCES ; ; CONTACT US : 



OVERVIEW OF TRIUNIVERSITY DATA RESOURCES 



The establishment of Tri-University Data Resources is the result of a joint venture between 
the University of Guelph, the University of Waterloo, and Wilfrid Laurier University. 

Members of the individual centres for data resources on each campus as well as 
representatives from the University of Guelph Computing and Communications Services and 
the University of Waterloo information Systems and Technology have joined forces to create 
this centralized on-line data service. 

The creation of this joint service arises from the need to dedicate resources to the 
centralization and distribution of data resources which have increasingly been produced in 
electronic format .Recent agreements on consortial use of data by Statistics Canada (through 
the DLI initiative) and ICPSR have allowed a rapid change in the way data is disseminated to 
educational institutions. The resources required to manage this acquisition and distribution of 
data is more than is possible for many institutions to handle given the current financial climate. 
The concept of regional centres for university data resources is being discussed and initiated 
throughout North America and Europe. 

Users will benefit from the pooling of resources as data from the three institutions will be 
offered to members of the three participating universities. Sharing technology and human 
resources will result in an increase in the ability to process, mount, and maintain new data as 
it arrives. User support will also benefit as expertise is shared between the three institutions. 

The main focus of the centralized service will be a joint web site where all data licensed to be 
shared is mounted and maintained on a single web site: http://tdr. uoquelph.ca 

Individual institutions will continue to offer support and service to their respective communities 
of users. Each institution will also maintain data licensed to its own community due to specific 
localized research needs and licensing restrictions. 

Overviews of each data centre including services offered and hours of operation can be found 
by following the links provided below. 

• University of Guelph Data Resource Centre 

• University of Waterloo Electronic Data Service 

• Wilfrid Laurier University 

Background documents related to the creation of this joint venture can be found by following 
the Publications link listed below. 

PUBLICATIONS 



Home Help Search TDR Data Contact Us Site Map Licences 
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http://statlib.stat.yale.edu/SSDA/ssda.html 







The Social Science Data Archive at Yale 

The Social Science Data Archive (SSDA) is the repository and reference center at Yale for machine-readable data 
sources in the social sciences. The SSDA owns and maintains a major collection of data from academic surveys, 
public opinion surveys, government agencies, international organizations, and related groups. SSDA codebooks and 
reference services are available through the Social Science Library and Information Services ; the Social Science 
Statistical Laboratory provides technical assistance for dataset users. 

The SSDA holdings are restricted to use by the Yale community. 

SSDA Holdings Information 

• SSDA Catalog: SEARCH by keywords or BROWSE by subjects. 

The SSDA Catalog includes descriptions of datasets, detailed holdings information, and links to documents 
about how to access the data files. 

• Yale Roper Collection ; Note that many of these datasets are not listed in the SSDA Catalog. 

• Orbis , the Yale University Library online catalog. Orbis includes records for datasets on CD-ROM and for 
codebooks which may not be included in the SSDA catalog. 

Help using SSDA data 

Extensive help using the data and statistical software on the Statlab network is available. 

Data on the Internet 

The SSDA maintains a list of Internet sites at which you can directly access numeric social science data. 

See also the list from UC San Diego . 

Data at Other Archives 

Inter-University Consortium for Political and Social Research , University of Michigan 1CPSR, the 
world's largest repository of computer-readable social science data, acquires, processes, and distributes 
data collections on a broad range of topics. Data holdings of ICPSR may be searched on line. Yale 
faculty, students and staff on the Yale network may download data directly from ICPSR. Questions 
about ICPSR data may be directed to the Data Librarian (iulie.linden@vale.edu ). 

Search the Roper Center Survey Archive. The Roper Center now offers a WWW based search 
facility for locating questions asked in polls in the Roper Survey Archive. 
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Searchable Catalogs of Data listed by the Social Sciences Data Collection at the Univeristy of California 
at San Diego 

CESSDA Integrated Catalog : Council of European Social Science Data Archives. Search the data 
archives of: France, Denmark, the UK, Norway, Sweden, Israel, the Netherlands, Hungary, Germany, 
Australia, and ICPSR. See also the World Map of Social Science Data Archives , Council of European 
Social Science Data Archives 

Glossary of social science data terminology 

Terms often used in a data library environment are defined in this glossary from UC San Diego. 



Yale University Social Science Statistical Laboratory 

comments: statheli/d'statfab.stat. vale. edu 

Revised: 22 August 2001 

Web Site: http://statlab.stat.yale.edu 

Copyright © Yale University, 1997 



[ Statlab Home Page | Social Science Library ] 
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Job Descriptions 



University of California, San Diego 



Job Description. 75% Data Services Librarian 
RESPONSIBILITIES: 

This position works on the maintenance, development and integration of Social Science 
Data collections and services within the Social Science and Humanities Library and the 
wider data user community to provide excellent access to these important resources. 

Working closely with the other units in the Data Government and Geographic 
Information Services group, Jim will work to better integrate services through various 
projects that include but are not limited to the ideas described below. While service 
decisions are still made within Data Services, production programming and server 
maintenance activities have been transferred to the Software Engineering Department. 
In order to do accomplish Data Services priorities, a significant amount of time is 
currently focused towards collaborating with the Software Engineering Department. 

Jim is in a unique position to provide national leadership on issues affecting 
dissemination and preservation of public data. Time to pursue these issues is an 
excellent outcome of the reorganization of this position. His postings to national 
listservs are sparking important debates and are being widely quoted and published. 

PROJECTS AND ACTIVITIES: 

The following projects and activities are at various conceptual stages, but all represent 
ways in which the goals of Data Services are carried out through this position. 

1 . Increase communications and education among docs/data/gis/maps staff to insure 
clear understandings of related holdings and their most appropriate uses in order to 
insure appropriate referrals. This will have added benefits for reference desk staff and 
the UCSD community as different outreach projects are developed within the group. 

2. Keep apprised of and involved in changes in the mode of data delivery from ICPSR, 
i.e. web based delivery to provide improved services to the users of these data sets. 

3. Increase and enhance our web based and PC-based tools for data and map/gis use. 

4. Create a web-based search and browse system that will help users find the *best* 
resource to answer their questions - whether the resource is a large dataset, a CD- 
ROM, a paper map, a web site, a UCSD librarian, etc. While the project will eventually 
describe all of our resources, deeper content focus will be on local San Diego and 
perhaps regional trans-border data. 

To this end, we have some major opportunities right now with gis/maps/docs/data. 

- We have a considerable amount of already-machine-readable materials to which we 
can enhance access. 

- We have a large and growing user base for these materials. 

UCSD Libraries Data Services Librarian Job Description 2001 Page 1 of 2 
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- There is an international infrastructure supporting activities like ours and working with 
standards-based tools allowing us to integrate our work with the work of others and 
collaborate internationally. 

- We have experienced staff and infrastructure in place and existing collections and 
tools that we can expand and enhance. We are not starting from scratch. 

- The tools and services we develop within the project I've outlined above can be 
expanded or adapted to other subject areas in the library. We could, in essence, be a 
pilot project or incubator for other projects. 

The California Digital Library recently embarked on a project called "California Counts." 
On the surface, it looks quite a bit like our proposed Docs/Data/Maps/GIS project, but 
there are some major differences that make the two projects complementary rather than 
redundant. 

California counts will provide access directly to a "data warehouse" of numeric data 
from (initially) about 6 studies. Users will be able to find, not only studies that exist (e.g., 
the census, the county and city databook), but the variables in those studies. People will 
use California Counts to find numbers in a few files and create tables, charts, maps, and 
downloadable data files directly. In short, California Counts is about DATA. 

The UCSD Docs/Data/Maps/GIS project will encompass information about literally 
thousands of datasets, studies, and files. These will include paper maps and books, 
numeric data files, GIS spatial information files, and more. Formats included in the 
project will be books, maps, computer files, and more. Computer files will be included if 
they are available on CD-ROM, SSDC, ICPSR, or publicly available on the Web. The 
project will not include the numeric data or variable level information. People will use the 
project (which will be available through the Web) to locate any of thousands of 
information products that may help them answer their questions. In short, the project is 
about META-DATA. 

California Counts and the UCSD projects do have one important thing in common. Both 
are using the same markup language and definitions (XML DTD) to store information. 
This will mean that in the future, CDL and UCSD will be able to build collaborative and 
interacting projects. 
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POSITION: Data Services Librarian 
DEPARTMENT : Reference 
PURPOSE AND SCOPE: 

The Data Services Librarian provides assistance with numeric data files, electronic 
government publications, and general reference sources. The Librarian provides 
consulting and instruction in the use of statistical information, including files distributed 
by the federal government. The Librarian also maintains relationships with university, 
state, national and international data organizations such as the U.S. Bureau of the Census’ 
State Data Center and the ICPSR. Includes some evening and weekend hours on a 
rotating basis. 

DUTIES AND RESPONSIBILITIES: 

Provides public access to U.S. Census data by consulting and advising patrons about 
availability of data and by retrieving information from data sets, including files stored on 
mainframe accessible tape cartridges. 

Instructs library users by teaching classes in the use of statistical data and prepares 
instructional guides. 

Provides reference service by participating in staffing the Reference Department’s 
reference desk approximately 6-8 hours per week, and by developing and maintaining 
knowledge of electronic and print information resources and services within the Libraries, 
on campus, and beyond. 

Selects and maintains the Data Services collections, including code books, by consulting 
appropriate sources, submitting order requests, downloading data sets, and reviewing the 
collections for possible transfers or supersessions. 

Participates in discussions regarding the future of the Regional Depository by 
contributing information about series that are delivered electronically, series that have 
been sold to commercial publishers, and other changes in the distribution of information 
from the federal government. 

Monitors developments and best practices elsewhere to ensure the excellence of 
Georgia’s data archive and electronic services by attending training, conferences and 
other professional meetings as appropriate. 
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Advises the Department Head about software, hardware, and other equipment which may 
be needed in either Data Services or the Reference Department. 

Maintains the Data Services Department page in accordance with the guidelines 
established by the Web Advisory Group. Selectively relocates government documents 
pages to the Reference Department. 

Keeps current on changes in the organization, serving on appropriate Libraries’ 
committees, and maintaining flexibility to assume similar duties and responsibilities as 
assigned. 

QUALIFICATIONS: 

REQUIRED: ALA-accredited MLS; Reference coursework in government documents or 
relevant experience; Familiarity with technology that supports research with government 
information and numeric data sets; Ability to prepare documentation which supports the 
use of statistical information, including the use of the Web for making the information 
available; Subject knowledge in the social sciences and quantitative analysis; 
Demonstrated ability to provide instructional presentations and training; Ability to 
manage a broad variety of tasks and shifting priorities; Excellent oral, analytical and 
written communication skills; Systematic attention to detail; and Ability to establish and 
maintain effective working relationships. 

DESIRED: Experience with census materials in print and electronic formats; Working 
knowledge of statistical analysis software packages; Working knowledge of HTML. 
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McMaster University 



JOB IDENTIFICATION 



McMaster University 
Pay Equity questionnaire 

******* Response Booklet ******* 




EMPLOYEE IDENTIFICATION 



Job Title Digital Resources Specialist 

Generic Job Class # (if applicable) 

Department Data Services (Library) 

Location Mills 217 

Telephone 

Reports to (title) Assistant University Librarian 

Department _Readers Services 

Location Mills 204 

Telephone ext. 23883 



Surname, Initial(s) 

I.D. Number 

Signature 

Date 

APPROVALS 

Supervisor’s Signature 

Date 

Department Head Signature 
Date 



(DO NOT COMPLETE SHADED AREA) 
JOB EVALUATION RESULTS 

Job Number __ ___ 

Job Evaluation T otal ; _ _ _ - 
Level 

•• Date ■ ' •' 



□ 

□ 



Vacancy 

Re-evaluation 



□ 

□ 



3 Year Update 
Review 



INSTRUCTIONS: 

PLEASE USE THIS BOOKLET AND THE RESPONSE SHEET TO ANSWER THE PAY EQUITY QUESTIONNAIRE. 
WHEN COMPLETED AND SIGNED, FORWARD BOTH TO THE HUMAN RESOURCES AREA OFFICE. 

O 
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McMaster University 



II - JOB SUMMARY 



What would you say is the primary purpose of the job? Please summarize the most important aspect of the job. 

This position manages the University Library's Data/Text Service; facilitates user access to digital data and text; and 
anticipates new technology in order to integrate that technology into services provided by the Library'. The Digital 
Resources Specialist works collegially with staff from Reference Services, who help to provide the content and interface 
requirements for the Data/Text Service; and with staff from the Systems unit, who provide the technological contexts for the 
D/T Service. 



What are the most important end results that are produced to accomplish this purpose? In the following space, please list 
the end results or specific accountabilities of the job Please do not list individual duties. 

For re-evaluations or revisions to an existing job, please clearly indicate changes using bold type, asterisks, etc. 



SPECIFIC ACCOUNTABILITY 


SKILLS / KNOWLEDGE / ABILITY 


1. 

Manages the University Library’s Data/Text Service by: 

• mounting digital material on the Service’s designate 
server. 

• assuming responsibility for the backup and ongoing 
care of Data/Text Service equipment. 

• ensuring compliance with software and data license 
restrictions by the Service and its users. 

• establishing appropriate security measures for the 
Service’s equipment and software within the 
parameters set by the Systems unit.. 

• reviewing and monitoring the use of the Service and 
generating appropriate statistical reports 


• comprehensive experience with UNIX, UNIX shell 
scripts and UNIX utilities 

• extensive knowledge of statistical and data management 
software (e g. SAS, SPSS, Open Text). 

• knowledge of the operations of information centres 

• familiarity and ability to work with network protocols 

• a degree in computer science would be highly relevant 


2. 

Facilitates User Access to digital data and text by. 

• working with Reference staff in all libraries to ensure 
that digital material accessed through the Data/Text 
Service is made available to users in a manner 
appropriate to their needs 

• preparing self-service interfaces for digital data/text. 

• working with faculty to create subsets of data for 
class/project use. 

• preparing user documentation related to the Service’s 
products and services. 

• instructing Library' staff and users in data/text retrieval 
and manipulation techniques. 

O . 


All of the above plus. 

• extensive knowledge of text retrieval software (e g. 

Open Text’s Livelink). 

• excellent oral and written communication skills 

• ability to interact in an efficient and pleasant manner 
with users at all level of computer and research 
expertise 

• ability to be flexible and adjust to changing 
technologies and priorities 

• training and presentation skills 

7g — 
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3. Takes an active role in ensuring that the University 
Library takes the best advantage of emerging digital 
services by: 

• participating, as a member of the Library’s Data 
Planning Group, in decisions concerning the selection 
of digital materials and the appropriate format of those 
materials. 

• anticipating new technology and working with 
reference staff to integrate such technology into the 
provision of reference service. 

• evaluating and recommending software and hardware 
requirements to ensure the ongoing development of the 
Data/Text Service. 

• participating in Library representation to ICPSR 
(Inter-University Consortium for Political & Social 
Research) , DLI (Data Liberation Initiative) and other 
such consortium bodies. 


All of the above plus: 

• demonstrable interest in evolving technology' 

• data management skills 

• time management skills 

• knowledge of the research and instructional needs (for 
digital data) of the academic community 

• ability to work effectively in a team environment, but 
with considerable independence. 

• problem solving and planning skills 


4. 




5. 




6. 





WP\forms\jobdescr 



The first two pages are available on disk from the Library Human Resources Office 

2 
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University of Virginia 



35% 



30% 



25% 



10% 



: Director, Geospatial and Statistical Data Center 



Administration 

Provides direction and leadership for geospatial and statistical data services within the 
Library and University. Coordinates Center activities with similar or related activities 
across the Library’s digital centers, digital library research program and other units as 
necessary. Evaluates Associate Director and Map Services Specialist (.5 FTE Library 
Assistant position within GIR); provides input to Associate Director for evaluation of 
Programmer/Analyst positions within Geostat. Assesses departmental training needs and 
assists others in assessing their own training needs. Prepares annual budget requests for 
student wages, collections, travel, training, and OTPS. Participates in the Information 
Technology Council and other Library-wide IT committees. 

Collections and Services 

Develops data collections in accordance with established collection development criteria. 
Consults with users on research and teaching projects requiring statistical and/or 
geospatial data, and promotes the use of data resources across the University. Works 
with digital library research and development team members on creating web-based data 
identification and retrieval tools. Supervises management of print map collections, 
atlases and related cartographic products, including collection development, organization, 
maintenance, and preservation. 

Public and Reference Services 

Provides reference and information service relating to government publications, social 
sciences data, and maps using print, microforms, and electronic information delivery 
systems. Serves as a resource for other Library faculty and staff with regard to the 
provision of geospatial and statistical data products and services. 

Professional Development 

Keeps abreast of and contributes to the professional and research literature in data 
librarianship and related fields. Attends and participates in professional meetings and 
conferences. Serves as Geostat and Library representative at regional, state, national and 
international meetings as needed. Assesses personal training needs and participates in 
training programs to meet those needs. 




ERLC 



88 



SPEC Kit 263 

Numeric Data Products and Services 



Access Policies 
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University of British Columbia 



http://data.library.ubc.ca/datalib/gen/copyright.html 




UBC Library CONTACT US | 



SITEMAP* 




General Licence Agreement Conditions 



Most of UBC Numeric Data Services' data files are acquired under a non-exclusive, non-assignable and 
non-transferable licence to use the data product. 

In consequence, access to these data files and dissemination of data retrieved from them is restricted to current 
faculty, staff or students employed by, or studying at UBC. UBC MAY provide limited World Wide Web 
(WWW) access to current students, staff, and faculty of other licensed Data Liberation Initiative (DLI) 
academic institutions for a limited selection of our data files. 

UBC CANNOT provide individual support to non-UBC users. Non-UBC users requiring support with our files or 
WWW services must contact their own educational institution's Library for assistance. 

The data files protected by these licence agreements are to be used only for teaching and for faculty or staff 
research, and in no circumstances for personal contract activities of faculty, staff or students. Use of these data 
files for any commercial purposes is prohibited. 

See the Data Liberation Initiative page for licence conditions applying to all Statistics Canada data sets. 



UBC users: 

Access restrictions are noted in Library catalogue records describing data files. Look under NOTES in the full 
record display and you’ll see something like: 

• UBC access only, or 

• All users, etc. 
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University of Pennsylvania 



http://www.library.upenn.edu/services/access/electronic.html 

^UNIVERSITY OF PENNSYLVANIA LIBRARY 

home access collections copying publications reserves 

hours borrowing computing doc delivery ref desk staff 
1 1 

£ LIBRARY ACCESS 



Section Index Access to Electronic Resources 



Access Policies 

► Library Access Policies 

► Alumni Use of the Library 

► Library 'Privileges for 
Visitors 

► Services for Patrons with 
Disabilities 

* Access to Electronic 
Resources 

► Use of Other Area 
Libraries 



Related Library Links 

► Penn's Libraries 

► Technical Help 

Related Penn Links 

► Directions to Penn 

► Parking on the Penn 
Campus 

► PennCards 

► Facilities Services - 
Barrier Free Access at 
Penn 

► Office of Affirmative 
Action and Equal 
Opportunity Programs 

Return to: 

► About the Library 

► About the Library A-Z 



The University of Pennsylvania is a private institution and its libraries are private 
libraries, maintained to support the teaching and research mission of the University. 
Use of the Library’s electronic resources is generally restricted to currently enrolled 
students, faculty and staff in order to comply with license agreements for educations 
use and to ensure access to limited resources. Restricted resources are marked with '< 
US on the Library Web. Visitors to the library are invited to use Franklin , the Librar 
online catalog, and to use the Library’s selection of unrestricted Internet Resources . 

• Franklin, the Library's Online Catalog: unrestricted. 

• Databases and Selected Reference Sources (e.g. MEDLINE, Britannica 
Online, ISI Citation Indexes): licensed for the use of faculty, staff, and 
currently enrolled students. 

In order to sign on to a database Penn faculty, staff and students must have a 
current PennCard. If you are not able to sign on to a restricted database conta 
your library’s circulation desk or call Van Pelt Circulation, (215) 898-7566. 

• CD-ROMs: In order to provide adequate access and support for research and 
study, use of compact disc databases in Van Pelt, Biomedical, and a number « 
other libraries is restricted to Penn faculty, staff, students and alumni. Use of 
compact disc databases received as depository documents is not restricted 



home hours help 15 libraries forms comments a-to-z patron searc 

Copyright © 1998-2001 University of Pennsylvania Library. All Rights Reserved. 



Last update: Tuesday, 1 6-May-2000 10:57:12 EDT 
Send mail concerning this page to: lihrarv(d)pobox. upenn. edu 
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Washington State University 



http://www.wsulibs.wsu.edu/MMR/hlda/hlda.htm 
griffin Catalog /Article Indcxcs/E-Journals/'iothcr Library Catalogs/#Your Rccord/Rcncwals\ 




HOLLAND LIBRARY DATA ARCHIVE 

The Holland Library Data Archive offers the Washington State University community access to a wide variety 
of numerical data resources. These have been acquired through the Library's United States and State of 
Washington depository programs, membership in the Inter-University Consortium for Political and Social 
Research (ICPSR) and other data gathering and distributing organizations. 

The Data Archive, located at Media Materials Services on the ground floor in Holland Library, serves as a 
local repository for machine readable data files. Currently the collection consists primarily of US Government 
and ICPSR Data Products . ICPSR Data Products may be downloaded directly on the WSU campuses. 
Access from off campus will be made available only through the HLDA via FTP . 



ACCESS and CIRCULATION POLICY: 

Because of licensing restrictions certain categories of data held or made available by HLDA are accessible ONLY TO CURRENT WSU 
STUDENTS, FACULTY, OR STAFF. Patrons wishing to gain access to the data files and documentation available at WSU or needing 
assistance with downloading information from ICPSR, should contact the HLDA staff in person during normal hours of operation, or 
electronically via the ICPSR Dataset Inquiry Form , or through E-Mail. ICPSR data resources can be made available to non- WSU 
patrons for a fee. 



How to Find Datafiles 



How WSU-Affiliated Users Can Obtain Datafiles 



Special Facilities and Support for WSU Users 



Links to Data Oriented Web Sites 




Links to Interactive Data Analysis 



FOR MORE INFORMATION, CONTACT: 

• Rvan Johnson . Official ICPSR Representative for WSU 
(johnsor@wsu. edu) 



Visitors since 6/14/99: 



05277 



Last updated on Wednesday July 25th, 2001. 






University of Waterloo 



http: / / tdr.ugelph.ca/cgi-bin/drc.cgi/ other 



| I AGREE I I DISAGREE 

Please read the following note on licensing. 

DATA ACCESS & USE RESTRICTIONS - TDR LICENCING AGREEMENTS 

Many of the datasets that are housed within TriUniversity Data Resources are subject to licensing agreements. 

As a rule, agreements stipulate that current students, faculty and staff of the University of Guelph, the University of Waterloo and 
Wilfrid Laurier University may use the data for academic and personal research only. Any use of these data for commercial 
applications or contract purposes will violate our licenses. Use by members of the community outside of these three 
institutions could also violate our licenses (please contact staff at any of the three Centres for more information). 

Some of the data may also be restricted to use by an individual university within the TriUniversities. This would be a result of 
single ownership by that member institution of a specific data set which is governed by a limited licensing agreement. 

In short, if you are currently a member of one of the three University communities (Student, Staff or Faculty), you may use the 
data available through the TriUniversity Data Resources for NON-COMMERCIAL applications such as teaching and 
academic research. If you are interested in our holdings, but are not a member of these University communities, we may be 
able to help you contact the appropriate organizations to arrange purchasing of the data for your own use. 

For more information, please feel free to contact members of TriUniversity Data Resources for details by clicking on the 
following LINK. 



Data Liberation Initiative (DLI) Agreement 



This site is best viewed with Netscape 2.0 or higher 
or Microsoft Explorer 4.0 or higher. 




Help us help you: 

Please fill in our User Survey Form 



Page Created and Maintained by 
Michelle Edwards : edwardsm@uoquelph.ca and 
Bo Wandschneider : bo@uoquelph.ca 
Please Send Comments to : drc@uoquelph.ca 
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